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Abstract: We survey a number of models from physics, statistical me- 
chanics, probability theory and combinatorics, which are each described in 
terms of an orthogonal polynomial ensemble. The most prominent exam- 
ple is apparently the Hermite ensemble, the eigenvalue distribution of the 
Gaussian Unitary Ensemble (GUE), and other well-known ensembles known 
in random matrix theory like the Laguerre ensemble for the spectrum of 
Wishart matrices. In recent years, a number of further interesting models 
were found to lead to orthogonal polynomial ensembles, among which the 
corner growth model, directed last passage percolation, the PNG droplet, 
non-colliding random processes, the length of the longest increasing subse- 
quence of a random permutation, and others. 

Much attention has been paid to universal classes of asymptotic behav- 
iors of these models in the limit of large particle numbers, in particular the 
spacings between the particles and the fluctuation behavior of the largest 
particle. Computer simulations suggest that the connections go even far- 
ther and also comprise the zeros of the Riemann zeta function. The existing 
proofs require a substantial technical machinery and heavy tools from var- 
ious parts of mathematics, in particular complex analysis, combinatorics 
and variational analysis. Particularly in the last decade, a number of fine 
results have been achieved, but it is obvious that a comprehensive and 
thorough understanding of the matter is still lacking. Hence, it seems an 
appropriate time to provide a surveying text on this research area. 

In the present text, we introduce various models, explain the questions 
and problems, and point out the relations between the models. Further- 
more, we concisely outline some elements of the proofs of some of the most 
important results. This text is aimed at non-experts with strong background 
in probability who want to achieve a quick survey over the field. 
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1. Introduction 

In the 1950ies, it was found that certain important real A^-particle ensembles 
(that is, joint distributions of N real random objects) can be described by a 

*This is an original survey paper 
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probability measure Pat of the form 
1 ^ 

FN(dx) = —AN{xf Y\ fi{dx^), x = {xi,...,XN)eWN, (1.1) 

i—l 

on the set 

Wn = {x = (xi, . . .,xn) ■■ Xi < X2 < ■ ■ ■ < Xn}, (1.2) 

where is the normaUzation, fi some distribution on M having aU moments, 
and 

An{x)^ n (^^i - 2;^) = det[(xp^)ij=i,...,7v], a: = (ii, . . . , Xat), 

l<i<j<A' 

(1-3) 

is the weh-known Vandermonde determinant. That is, P^v is the transformed 
configuration distribution of a vector of N particles, distributed independently 
according to /i under the influence of the mutually repelling density , prop- 
erly normalized to a probability measure on the so-called Weyl chamber Wn- 
The most important and one of the earliest examples is the joint distribution 
of the eigenvalues of a random matrix drawn from a Gaussian Unitary Ensem- 
ble (GUE), in which case /z is a Gaussian distribution, and P^v is called the 
Hermite ensemble. Also spectra of a couple of other types of random matrices 
turned out to admit a description of the form among which the Wishart 

matrices, where /i is a Gamma distribution and P^v the Laguerre ensemble. The 
explicit form of Hl.l|l served as a starting point for many deep investigations of 
asymptotic spectral properties of random matrices. Furthermore, non-colliding 
Brownian motions (sometimes called Dyson's Brownian motions) could also suc- 
cessfully be investigated in the early 1960ies using descriptions in the spirit of 
p.lfl . Also variants of (e.g., with replaced by A^v or by Aj^) turned 

out to have a significant relevance and could be treated using related methods. 

For a long while, spectra distributions of certain random matrices (and the 
closely related non-colliding Brownian motions) were the only known important 
models that admit a description as in Hl.l|l . However, in the second half of the 
1990ies, the interest in non-colliding random processes was renewed and was put 
on a more systematic basis, and other types of statistical physics models were 
found to admit a description of the form certain random growth models 

(equivalently, directed last passage percolation), polynuclear growth models, 
the problem of the length of the longest increasing subsequence in a random 
permutation, the Aztec diamond, and others. Furthermore, effective analytic 
techniques for deriving asymptotic properties of Pat, which were developed in 
the early 1990ies, have recently been systematically extended and improved. 
As a consequence, in recent years a lot of questions about these models could 
be answered. The last ten years saw an exploding activity of research and an 
enormous progress in the rigorous understanding of some of the most important 
of these models, and the work is still going on with an increasing velocity. A 
significant number of deep and important results on universality questions have 
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recently been solved, building on work of the last 40 or so years. However, it 
still seems as if a complete understanding of the phenomena has not yet been 
achieved, since many of the existing proofs are still based on explicit calculations 
and direct arguments. There seem some intricate mechanisms present which 
have been understood only in special cases by formal analogies. It will be an 
important and difhcult task in future to find the essences of the phenomena in 
general. 

In view of the great recent achievements, and also in order to draw the at- 
tention of non-experts to this field, it seems fruitful to write a comprehensive 
survey on most of the models that can be described by an ensemble as in 1)1. l|l . 
The present text is an attempt to explain the problems and questions of interest 
in a unifying manner, to present solutions that have been found, to give a flavor 
of the methods that have been used, and to provide useful guidelines to much of 
the relevant literature. It is aimed at the non-expert, the newcomer to the field, 
with a profound background in probability theory, who seeks a non-technical 
introduction, heuristic explanations, and a survey. Our concern is to compre- 
hensively summarize the (in our opinion) most important available results and 
ideas, but not to lose ourselves in details or even technicalities. In the three 
remaining sections, we give an account on the three research areas we consider 
most important in connection with orthogonal polynomial ensembles: random 
matrix theory, random growth models, and non-colliding random processes. 

A probability measure Pat of the form 1)1.1(1 is called an orthogonal polyno- 
mial ensemble. The theory of orthogonal polynomials is a classical subject, and 
appears in various parts of mathematics, like numerics, combinatorics, statistics 
and others. The standard reference on orthogonal polynomials is |Sz75| . How- 
ever, the term 'orthogonal polynomial ensemble' is relatively recent and may be 
motivated by the following. Let (TrAr)^?^^^ denote the sequence of polynomials 
orthogonal with respect to the inner product on the space (fi) . The polynomi- 
als are unique by the requirement that the degree of ttat is TV, together with the 
normalization TrN{x) = x^ -\- 0{x^~^). They may be obtained from the mono- 
mials x I— > x^ via the well-known Gram-Schmidt algorithm. A nice relation^ 
between the orthogonal polynomials and the ensemble Pat in 1)1. l|l now is the 
fact that TTjsf may be seen as the 'expected polynomial' of the form Y[i=i (x — Xi) 
with {xi, . . . ,xn) distributed according to Pat, i.e., 



2. Random matrix theory 

In spite of the appearance of various random matrix distributions in several 
areas of mathematics and physics, it has become common to use the term ran- 
dom matrix theory exclusively for those matrix distributions that are used, since 

^Further connections will be exploited in Section i'2 . 71 below. 




N 



(1.4) 



i=l 
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Wigner's introduction to physics in the 1950ies, as models for energy levels in 
slow nuclear reactions. Measurements had already given rise to a hope that the 
energy levels follow a universal picture. Wigner's hope was that the eigenvalues 
of appropriate classes of random matrices would be mathematically tractable 
and would reflect this universality in the limit of unbounded matrix size. Based 
on Wigner's early work, Dyson |Dy62a| , |Dy62c| argued on physical grounds 
that three certain matrix classes be relevant for the description of energy lev- 
els, the by now famous orthogonal, unitary and symplectic Gaussian ensembles. 
It soon turned out that their eigenvalue distributions are given in terms of cer- 
tain orthogonal polynomial ensembles. In the mid-nineties, seven more Gaussian 
random matrix ensembles were introduced |\^ . |AZ96j . jAZ97j . |Zi97] . and it 
was argued that these in total ten classes form a complete classification of the 
set of random matrix ensembles that are physically relevant in a certain sense, 
subject to some symmetry constraints. 

In the last decades, random matrix theory became a major mathematical and 
physical research topic, and more and more exciting phenomena were found. In 
the last decade, universality of many aspects could be proven for large classes 
of models, and the research is going on increasingly fast. 

The standard reference on the mathematical treatment of random matrices is 
|Me91j . Authored by a physicist with strong mathematical interest, it explains 
the physical relevance of a host of random matrix models and provides a great 
amount of relevant formulas and calculations. A recent historical survey on the 
field from a physicist's point of view in FS V03| (see the entire volume), which 
contains a vast list of references, mostly from the physics literature. A thorough 
summary of the proofs of some of the most important results on random matrix 
theory from the viewpoint of Riemann-Hilbert theory is in pe98]. F urther sur- 
veying and appetizing texts on random matrix theory are |TW93b| and |Di03| . 
When the present text is being written, some (teams of) authors are preparing 
monographs on random matrix theory, among which IFoOS-H . 

In the present section we first introduce to some of the above mentioned 
matrix ensembles and their eigenvalue distributions in Sections l2.1H2.4l present 
the famous Wigner semicircle law in Section 12.51 discuss correlation functions 
in Section 12.61 and introduce the important method of orthogonal polynomials 
in Section 12.71 Afterwards, we present the most important asymptotic results 
on eigenvalues, the bulk limits in Section 12.81 and the edge asymptotics in Sec- 
tion 12.91 The main proof method, the Riemann-Hilbert theory, is outlined in 
Section [2.101 Finally, in Section [2. Ill wc explain some relations to the zeros of 
the famous Riemann zeta function. 

2.1 The questions under interest 

Consider a random Hermitian N x A^-matrix, M, and denote its eigenvalues 
by Ai < A2 < • • • < Xn- Hence, A — (Ai,...,AAr) is a random element of 
the closure of the Weyl chamber Wn in (|1.2(l . Among others, we shall ask the 
following questions: 

(i) What is the distribution of A for fixed N eNl 
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(ii) What is the hmiting scaled distribution of A as — > oo, in terms of the 
empirical measure ^'x ' ^-"^ appropriate scaling of Ai? 

(iii) What is the limiting behavior of the largest eigenvalue, Aat, as ^ oo? 
(Or of the smallest, Ai, or the joint distribution of a few of the smallest, say 
(Ai, . . . , Am) for some m.) More precisely, what is the right normalization 
for a law of large numbers, and what is the right scaling for a limit law, if 
present? 

(iv) What are the limiting statistics of the spacings between neighboring eigen- 
values? How many gaps are there with a given maximal length? What is 
the average distance between Xcn-tn ^-iid ^cN+tn for some c G (0, 1) and 
some r jsf oo such that ri^/N oo7 

Question (iii) refers to the edge of the spectrum, while (iv) refers to the bulk 
of the spectrum. 

The so-called Wigner surmise conjectures that the limiting spacing between 
two subsequent eigenvalues of a large Gaussian matrix should have the density 
(0, oo) 3x1-^ Cxe~'^^ . This is true for a (2 x 2)-matrix ( ^ ' ) with independent 
standard Gaussian entries a,b,c: The spacing A2 — Ai is equal to [(a — c)^ -I- 
46^]^/^, whose square has the x^-distribution. However, the Wigner surmise 
turned out to be inaccurate (even though rather close to the true distribution) : 
the asymptotic spacing distribution is different. 

2.2 Matrix distributions 

It turned out ^Dy62a^ that, according to time reversal invariance properties of 
the material considered, basically three different matrix classes are of interest as 
models for energy levels of nuclea: matrices whose entries are (1) real numbers, 
(2) complex numbers, and (3) quaternions. One basic requirement is that the 
random matrices considered be symmetric, respectively Hermitian, respectively 
self-dual, such that all the eigenvalues are real numbers. For the (famous and 
most studied) special case of Gaussian entries, these three cases correspond 
to the Gaussian Orthogonal Ensemble (GOE), the Gaussian Unitary Ensemble 
(GUE) and the Gaussian Simplectic Ensemble (GSE). In the following, we shall 
concentrate mostly on the unitary ensemble, since this class is, in some respects, 
technically the easiest to treat and exhibits the farthest reaching connections to 
other models. 

We assume that M = (Mij )ij=i....^7v is a random Hermitian {N x iV)-matrix 
with complex entries. In particular, the diagonal entries Mi^i are real, and for 
i^j,we have M,,, = M^^ + i M™ = M^y - i Mj)] = M,- where M^y and 
Mj'l are the real part and imaginary part, respectively. 

Two basic respective requirements are (1) independence of the matrix entries, 
and (2) invariance of the distribution of the matrix under unitary conjugations. 
These two ideas lead to different matrix classes: 

Wigner matrices: We call the random Hermitian matrix M a Wigner matrix 
if the collection {M™ : i,j = l,...,N,i < j} U {M^'j : i,j = l,...,N,i < 
j} U {Mi^i: i — 1,...,N} consists of independent, not necessarily identically 
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distributed, random variables with mean zero and a fixed positive variance, which 
is the same for the real parts and for the imaginary parts, respectively. 

Hence, there are TV^ independent real variables that determine the distribu- 
tion of M. The distribution of the diagonal elements is arbitrary, subject to 
moment conditions. 

Unitary-invariant matrices: We call the random Hermitian matrix M unitary- 
invariant if the joint distribution of its entries has the form 

N 

P(dM) = const. e"^(*^) dM,,, [| [dA-^^' dM™] = const, g-^^*^' dM, 

i=l l<!<j<Ar 

(2.1) 

for some function F, and, for every unitary matrix U , P(dM) is invariant under 
conjugation with U . 

The last requirement implies that e'^'^'^^-^'^''^ d[UMU-^ = e^-^f*^) dM, 
and since it is easy to see that d[UMU^^ — dM De98, p. 92], the function 
F must be a symmetric function of the eigenvalues of M. One particular and 
important example is the case where 

F{M) = Tr(g(Af )), Q{x) = 72^0;^^' + • • • + 70, a polynomial with 72^ > 0. 

(2.2) 

With the exception of the Gaussian case j = 1, there are strong correlations 
between all the matrix entries. The idea behind the invariance under unitary 
conjugations is that the matrix distribution should not depend on the observa- 
tion system, as long as it is based on a unitary coordinate axis. 

The famous GUE lies in the intersection of the Wigner-class and the unitary- 
invariant class. It is a Wigner matrix with all the sub-diagonal entries being 
complex standard normal variables^ and the diagonal entries being a real normal 
variable with variance two^. Alternately, it is the unitary- invariant matrix of the 
form ini) with F{M) = Tr{M^). 

The GOE is the real variant of the GUE; i.e., the sub-diagonal entries are 
independent standard real normal variables with the same variance as the di- 
agonal entries. Hence, the GOE has ^N{N + 1) independent sources of real 
randomness. 

The GSE is the symplectic variant of the GUE, i.e., the diagonal entries are 
real standard normals as in the GUE, and the sub-diagonal entries are elements 
of the quaternion numbers. Their four components are i.i.d. real standard normal 
variables. Hence, the GSE has N + 2N{N — 1) independent real randomnesses. 

Further important related classes of random matrices are the Wishart ma- 
trices, which are of the form A* A with A a (not necessarily square) matrix 

^By this we mean that the real and the imaginary part are two independent standard 
normal variables. 

^Some authors require the sum of the variances per entry to be equal to one, or equal to 
l/N. 
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having throughout i.i.d. complex normal entries (first considered in a multivari- 
ate statistics context by Wishart |Wi28p . See |Me91j for further classes. 

2.3 Eigenvalue distributions 

Let Ai < A2 < • • • < Aat be the TV eigenvalues of the random Hermitian matrix 
M. We ask for the distribution of the random vector A = (Ai, . . . , Aat). A concise 
answer for a general Wigner matrix M seems inaccessible, but for unitary- 
invariant ensembles there is a nice, fundamental formula. We formulate the 
GUE case and make a couple of remarks afterwards. 

Lemma 2.1 (Eigenvalue distribution for GUE). Let M be a random ma- 
trix from GUE. Then the distribution of the vector A — (Ai, . . . , Aat) of eigen- 
values of M has the density 

-Pjv(x) = — Aw(a:)2 TTe-"', x = (xi, . . . , xjv), (2.3) 

r=i 

with Zn the appropriate normalizing constant on the Weyl chamber Wn in 

C3. 

Sketch of the proof. Choose a (random) unitary matrix U which diagonalizes 
M, i.e., the matrix D — UMU~^ is the diagonal matrix with the eigenvalues 
on the main diagonal. Hence, 

dM = d{U*DU) ^ AU* ■ D - U + U* ■ AD - U + U* ■ D ■ dU 

^U* ■ (AD + U -AU* ■ D + D-AU -U*) -U , , 

^ ' (2.4) 

= AD ■ AU* ■ D ^ D ■ AU - U* 

= AD + AA-D-D-AA, 

where we used the invariance of AM under unitary conjugations, and wc intro- 
duced AA = U ■ AU* = —dU ■ U* . Now integrate over dM^.j with i < j and use 
calculus. See ICTe?m Ch. 3] or IHPOOl Ch. 4] for details. □ 

Remark 2.2. (i) We chose the normalization Zn such that Pn is normalized 
on Wn — {x e : xi < X2 < ■ ■ ■ < xn}- We extend P/v to a permutation 
symmetric function on M^. Hence, Pn — -^Pn is a probability density 
on R^. 

(ii) The density in (|2.3|) is called the Hermite ensemble. This is one of the 
most prominent examples of an orthogonal polynomial ensemble; the name 
refers to the Hermite polynomials which form an orthonormal base with 
respect to Gaussian weights. 

(iii) For the GOE and the GSE, there are analogous formulas. Indeed, replace 
Am{xY by AAr(a;)'^ with (3=1 respectively /3 = 4 to obtain the corre- 
sponding statement for GOE, respectively for GSE |Me91l Ch. 3]. The 
three matrix classes are usually marked by the parameter 



/3 = 1 for GOE, 



13 = 2 for GUE, 



/? = 4 for GSE. 



(2.5) 
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(iv) It is easy to extend Lemma l2. II to unitary- invariant matrix distributions. 
Indeed, if the distribution of M is of the form l|2.1|l with F{M) = 
/(Ai, . . . , Aat), a symmetric function that depends only on the spectrum of 
M, then the density of (Ai, . . . , \n) is proportional to a; i-^ e^^^^^ A^ix)"^ . 
An analogous assertion is true for the orthogonal case, see [HPOOl Ch. 4]. 

(v) If M = A* A is a Wishart matrix, i.e., A is an {N x A:)-matrix (with k < N) 
having throughout independent complex standard normal entries, then the 
vector of eigenvalues of M has the density |Ja64| 

1 ^ 

x^- Ajv(x)2rr[xf"'=e-"'], M^Arn(0,oo)^. (2.6) 

'^N.k , 

This ensemble is called the Laguerre ensemble. 

(vi) Using Selberg's integral |HPOOI p. 118/9], the normalizing constants of the 
Hermite ensemble and the Laguerre ensemble may be identified in terms 
of the Gamma-function. Indeed, for any /3 > 0, we have 

and, for any a > 0, 

/ \^N{x)f[[[x^ e 'Jdx=_[J_ p. • (2.8) 

i=l j=0 U + 2 / 

(vii) There is obviously a mutually repelling force between the eigenvalues in 
(|2.3|l : the density vanishes if any two of the N arguments approach each 
other. It does not seem easy to derive an intuitive reason for this repellence 
from random matrix considerations, but if the matrix M is embedded 
in a natural process of random Hermitian matrices, then the process of 
eigenvalues admits a nice identification that makes the repellence rather 
natural. This is the subject of Section ETI below. 



2.4 Circular ensembles 

An important type of random Gaussian matrices are the circular ensembles, 
which were introduced in |Dy62a| in the desire to define a matrix model that 
can be seen as the conditional Gaussian ensembles given a fixed value of the 
exponential weight F(M) in (|2.1|l . Again, there is an orthogonal, unitary and 
symplectic version of the circular ensemble. 

We give the definition of the circular ensembles |Me91l Ch. 9]. The circular 
orthogonal ensemble (COE) is the unique distribution on the set of orthogo- 
nal symmetric {N x A^)-matrices that is invariant under conjugation with any 
real orthogonal matrix. That is, an orthogonal symmetric random matrix S is 
COE-distributed if and only if WSW~^ has the same distribution as S, for any 
real orthogonal matrix W . The circular unitary ensemble ( CUE) is the unique 
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distribution on the set of complex unitary (N x iV)-matrices that is invariant 
under (two-sided) transformations with unitary matrices, i.e., a complex unitary 
random matrix S is CUE-distributed if and only if USV has the same distribu- 
tion as S, for any two unitary matrices U and V. Finally, the circular symplectic 
ensemble ( CSE) is the unique distribution on the set of self-dual unitary quater- 
nion matrices that is invariant under every automorphism S i— *■ H^^S'VF, where 
W is any unitary quaternion matrix and its dual. 

All eigenvalues of the circular matrices lie on the unit circle and may be 
written = e' with Q < Oi < 62 < ■ ■ ■ < On < 27r. One advantage of 
the circular ensembles is that the joint distribution density of their eigenvalues 
admits particularly simple formulas. Indeed, adopting the parameter (3 = 1,2,4 
for the COE, CUE and CSE, respectively (recall H2.5|l ). the density of the vector 
{9i, . . . ,9n) of eigenvalue angles is given as 



(2.9) 

Here we chose the normalization such that P^""''^^ is a probability density on 
Wn n [0, 27r)^ where Wn ^ {x £ : xi <■■■< xn} is the Weyl chamber. 

2.5 The law of large numbers: Wigner's semi-circle law 

In this section we present the famous semi-circle law first proved by Wigner: the 
convergence of the mean eigenvalue density as the size of the matrix increases to 
infinity. This is an asymptotic statement about the convergence of the empirical 
measure"* of the appropriately scaled eigenvalues of a random matrix towards 
the distribution 

^.iy^ll[_^,^,(x), (2.10) 

the famous semicircle distribution. We first formulate the semicircle law for the 
CUE, make some remarks and sketch two proofs. Afterwards we summarize 
some extensions. 

Theorem 2.3 (Semicircle law). Let the random matrix be a GUE-matrix 
of size N, with eigenvalues A^"' < • • • < A^'. Let 



N 

i 

fJ-N 



1 ^ 

wher^Xr^^N-ix^\ (2.11) 

i=l 

be the empirical measure of the rescaled eigenvalues. Then fij^ converges weakly 
in distribution towards the semicircle distribution 11^ in l|2.10|l . 

We shall call A^"', . . . , A^' the (unsealed) eigenvalues and A'^"', . . . , A^' the 
(re)scaled eigenvalues. 

^By the empirical measure of N points xi, . . . ,xpf we denote the probabiUty measure 
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Remark 2.4. (i) Theorem 12.31 reveals that the eigenvalues are of order ^/N 
and that the largest behaves like ^/2N . In particular, all eigenvalues lie 
in the interval 7V^/^[— — e, \/2 + e] for any e > with overwhelming 
probability, and the spacings between subsequent eigenvalues are of order 
7V~i/2 in the bulk of the spectrum and much larger close to the edge. 

(ii) The convergence takes place in the sense that the expectation of every 
bounded and continuous function of the empirical measure converges. Note 
that the moments (i.e., the family of maps /i ^ j x*' ii{dx) for fc e N) 
constitute a convergence determining family. 

(iii) Note that, for any a < b, 

E[^iN{[a,b])] =1e[#{*: A, €[a,6]}]. 

In particular, the semicircle law states that the expected number of un- 
sealed eigenvalues A-"' in the interval [aN^ , bN^] behaves like 7V^*([a, b]). 
See Remark 12. 5f vi) for further asymptotic statements. 

(iv) The convergence in Theorem 12.31 has also been proved |Ar67| in distribu- 
tion in the almost sure sense, see |HPOOI Th. 4.1.5]. More precisely, let 
(Mj j)j be a sequence of independent standard complex normal ran- 
dom variables and denote by = {Mi j)i j<:f^ the {N x iV)-corner. Let 
UN (as in H2. 111) 1 denote the empirical measure of the rescaled eigenval- 
ues of A/jv. Then all the fc-th moments of fj,N converge towards the fc-th 
moment of /x* . 

(v) See |HPOOI Ch. 4] for the statement analogous to Theorem 12.31 for the 
orthogonal ensembles. O 

We turn now to sketchs of two proofs. 
Sketch of the first proof: the method of moments. This is Wigner's 
original method [WiSS] . |Wi58| . see jHPOOL Ch. 4]. The idea is that it suffices 
to prove that the expected moments oi converge to the ones of /i*, i.e.. 



lim E 

TV— too 



x'' Hn{'^x) = / a:V*(dx), fc G N. (2.12) 



By symmetry, all odd moments of both /iat and /i* are zero, hence it suffices to 
consider k = 2m. The (2m)-th moments of are known to be In — (^™). Note 
that the left hand side is equal to the normalized trace of Mfj^, i.e.. 



N „ , N 



E 



/ x^'"M^(d.)] = ^Ee[/ -'"^(«)(d-)] - ]^e[E(A[ 

N _2m 



V 2m 



E[Tr(Mr)]=^ E E[nAV... 



il,---,i2m = l i = l 



(2.13) 

where Mi^k denote the entries of the matrix M^y. Some combinatorial work has 
to be done in order to discard from the sum those terms that do not contribute, 
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and to extract the leading terms, using the independence of the matrix entries 
and rough bounds on the moments of the matrix entries. The term coming from 
the subsum over those multi-indices zi, . . . , with . . . , i2m} < m + 1 is 

shown to vanish asymptotically, and the one with #{zi, . . . ,«2m} > m + 1 is 
shown to be equal to zero. □ 

The second proof is in the spirit of statistical mechanics and is based on the 
eigenvalue density in 1)2. Indeed, the convergence is derived with the help of 
large-deviation type arguments and the minimization of a certain energy func- 
tional. In particular, the semicircle law turns out to be the unique minimizer, 
because of which it is sometimes called an equilibrium measure for that func- 
tional. We partially follow the presentation in |De98[ Ch. 6], which is based on 
| Jo98| and |DMK98| . A general reference for equilibrium measures and related 
material is |ST97| . 

Sketch of the second proof: the equilibrium measure method. The start- 
ing point is the observation that the joint density P/v of the unsealed eigenvalues 
in H2.3|l is of the form PNix) — with the Hamiltonian 

N 

i7^(x) = ^a;2-2 ^ \og{x, - x,). (2.14) 

1=1 l<i<j<N 

In order to obtain a non-degenerate limit law, we have to rescale the A*"' in 
such a way that both parts of Hn{x) are of the same order in N. Since the 
second part is always of order N'^, it is clear that we should consider the scaling 
A-"' = A^~2 A^"' as in the theorem. The vector A*™' of the rescaled quantities 
has the density 

P(A<") e dx) = J-e-^'-f^"(") da;, (2.15) 

where 

1 ^ 2 

^^(^) = ]^E^^'^]^ E ^og{x,-x,), (2.16) 

1=1 l<i<j<N 

and we absorbed some terms in the new normalizing constant. In terms of the 
empirical measure of the rescaled quantities, /xjv, the Hamiltonian takes the 
shape Hn ~ 2{iim), where 

^ [ x^ ^i{dx) - [ [ log\x-y\fl{dx)^l{dy), ^ S Xi(R). (2.17) 

Here we suppressed the diagonal terms, i.e., the summands for i — j, which is a 
technical issue. Since the integration is only of the order TV and the exponent of 
order N"^, it is clear that the large- behavior of the measure J-e~^ j^j; 

is determined by the minimizer(s) of the variational problem 



E= inf I(rt). 



(2.18) 
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The minimizer(s) are called equilibrium measure(s). According to (a high-dimen- 
sional variant of) the well-known Laplace method, the value of E should be the 
large- iV exponential rate of Pn{x) with speed iV^, and the empirical measures 
/zjv should converge towards the minimizer(s). 

The analysis of the problem in (|2.18|l is not particularly difficult. Using stan- 
dard methods one shows the existence and uniqueness of the equilibrium mea- 
sure and the compactness of its support. Using the Euler-Lagrange equation 
in the interior of its support, one identifies the equilibrium measure with the 
semicircle law, However, in order to show the convergence of /i^r towards /z*, 
one needs to show that the contribution coming from outside a neighborhood 
of /i* is negligible, which is a more difficult issue. This is carried out in j,To98| . 

□ 

Remark 2.5. (i) The moment method has been generalized to a large class 
of Wigner matrices, [HPOOI Ch. 4]. Indeed, assume that the real and the 
imaginary parts of the sub-diagonal entries of Mjv are independent, cen- 
tred and have variance one and that the diagonal entries have variance 
two, and assume that, for any G N, their fc-th moments are uniformly 
bounded in N , then the conclusion of Theorem 12 . 31 remains true. 

(ii) The equilibrium measure method has been generalized to a large class 
of unitary-invariant matrices in |lD99j and |DMK98j . see also |De98] . To 
mention one of the most obvious generalisations, let Mjv be a matrix as 
in H2.1|l with F as in (|2.2|) . i.e., the eigenvalues have the density in (|2.3|l 
with the term xj replaced by the polynomial Q{xi) = 72^2^^"' + 0(x^^^ ^); 
recall Remark l^fiv). The correct scaling is A^"' = N^"^ \\'^\ and in the 
limit as — > 00, only the leading term of Q{xi) survives. The empirical 
measure of the A^"' converges weakly towards the equilibrium measure of 
the functional 

jj,^ I -^2]X^^ ^i{dx) ~ i i log|x-y|/i(da;)^(d?/). (2.19) 

JR JR JR 

The analysis of this functional and the proof of convergence towards its 
minimizer is similar to the proof in the special case where Q{x) = x^ . 
The equilibrium measure has a density, and its support is compact. If t\} 
denotes the density and [—a, a] its support, then i/)(a;) — [c? ~ x'^)'^h\{x) 
for |a;| < a, where h\ is a polynomial of order 2j — 2. 

(iii) Even more generally, one starts immediately from distributions as in (|2.3|l 
with x\ replaced by NV{xi) (note the factor of N) with some sufficiently 
regular function V tending to infinity at infinity sufficiently fast. With this 
ansatz, no rescaling is necessary, i.e., the empirical measure of the unsealed 
vector (A*j^"\ . . . , A^^') converges. The relevant functional is then the one 
in H2.19|l with ^ijX^^ replaced by Vix). The Euler-Lagrange equations for 
this functional are, for some / € M, 

2 / log|x-yrV*(dy) + l^(x) a; e supp(/x,)°. (2.20) 

Jr 
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However, for general V , the explicit identification of the minimizer is con- 
siderably more difficult and involved. In general, if V is convex, then the 
support of the equilibrium measure is still an interval, but in the general 
case it consists of a finite union of intervals, provided that V is analytic 



(iv) The energy functional 2 in H2.17|l has an interpretation in terms of an 
electrostatic repulsion in the presence of an external quadratic field, if 
/Lt is the distribution of electrons. The second term is sometimes called 
logarithmic entropy or Voiculescu's entropy, see |Vo93| and |Vo94| . 

(v) An advantage of the equilibrium measure method is that it opens up 
the possibility of a large-deviation principle for the empirical measure of 
the rescaled eigenvalues. (This is, roughly speaking, the determination of 
the large- iV decay rate of the probability for a deviation of the empirical 
measure from the semicircle law in terms of a variational problem involving 
the energy functional.) The first proof of such a principle is in )BA(t97| . 
after pioneering (and less rigorous) work in |Vo93| and Vo94]. Extensive 
and accessible lecture notes on large deviation techniques for large random 
matrices may be found in |Gui04| . 

(vi) In the course of the equilibrium-measure proof of Theorem l2.3l (see |De98l 
Theorem 6.96]), for every fc e N, also the weak convergence of the k- 
dimensional marginal of Pjv with density 



towards the fc-fold product measure /if''^ is proved. As an elementary con- 
sequence, iV~'^ times the expected number of /c-vectors of different rescaled 
eigenvalues in [a,b] converges towards ii^,{[a,b])'' . 
(vii) There is an analogue of the semicircle law for the spectrum of the cir- 
cular ensembles introduced in Section 12.41 without normalisation of the 
eigenvalues required. An innovative technique for deriving this law was in- 
troduced in DS94) (see also DEOl,), where the asymptotic independence 
and normality of the traces of powers of the random matrix under con- 
sideration is shown. Related results are derived in |DS94| for the problem 
of the longest increasing subsequence of a uniform random permutation, 
which is introduced in Section 13.51 ^ 

2.6 Correlation functions 

In this section wc let Pjv : Wn [0, oo) be any probability density on the Weyl 
chamber Wn in (|1.2|) and A = (Ai, . . . , Ajv) € Wn be a random variable with 
density Pn. We introduce the so-called correlation functions of Pn, which will 
turn out to be important for two reasons: (1) much interesting information about 
the random variable A can be expressed in terms of the correlation functions, and 
(2) when specializing P/v to an orthogonal polynomial ensemble, the correlation 
functions admit a determinantal representation which will be fundamental for 
the asymptotic analysis of the ensemble. 



|l>MK98j. 




(2.21) 
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We extend P/v to a permutation invariant function on . Then Pjv W.^^ 
is a probability density on R^. For k E N, the k -point correlation function is 
defined as 

N Nl f ^ 

(N-ky.J^N-k 

(2.22) 

Then Pjy^k — ■^^^^r^^fc"' is a probability density on W', the marginal density of 
P/v in H2.21|l . It is a simple combinatorial exercise to see that, for any measurable 
set A C M, the quantity /^^ i?^"'(x) d'^x is equal to the expected number of k- 
tupels (Ail T ■ ■ T^ik) of distinct particles such that Xi. g ^ for all j = 1, . . . , /c. 
In particular, Rl^^x) dx is the expected number of particles in da;. 

As a first important application, the probability that a given number of par- 
ticles lie in a given set can be expressed in terms of the correlation functions as 
follows. 

Lemma 2.6. For any N (z N, any m e {0, 1, . . . , N} and any interval / C M, 

< iV: A. e /} = m) = ^-^ ^ / P<"'(x) d'=x. (2.23) 

^ ' m! '^-^ (k — my. Jjk 

k—m 

Sketch of the proof. We only treat the case m = 0, the general case being a 
simple extension. Expand 

N N 
i=l fc=0 

where the functions Cfe are permutation symmetric polynomials, which are de- 
fined by the relation Oil^il-^: ~ Q^i) = J2k=oi^^)'^^'^-^''(k{a) for any z € R and 
a = (ai, . . . ,ajv) € R^. Now multiply by the density Pjv and integrate over 
M.^ . Using the explicitly known coefficients of the polynomials Cfe, and using the 
permutation invariance of P^"', one arrives at (|2.23|l . □ 

Also the expected number of spacings in the vector A can be expressed in 
terms of the correlation functions. For x — {xi, . . . ,xn) S Wn, m G K and 
s,t>0 denote by 

S^"\s;x) = #[j e {!,..., N~ I}: x,+,-Xj<s}, (2.24) 
{s,u;x) = #{je{l,...,N-l}:xj+i~Xj<s,\xj-u\<tY (2.25) 



(N) 
t 



the number of nearest-neighbor spacings in the sequence xi^ . . . ,xn below the 
threshold s, respectively the number of these spacings between those of the 
Xi, . . . ^xn that are located in the interval with diameter 2t around u. Clearly, 
S'-"^[s;x) = hmf^oo S'j"'(s,u;a;). It is convenient to extend 5''"'(s;-) and 
S["\s, u; •) to permutation invariant functions on R^. 
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Lemma 2.7. For any N eN and t, s > 0, and it G M, 

E[sr(s,u;X)] 

Yl 71 T\T / / ^r'(''' + 2/2, r + 2/3, . . . , r + T/fe) dy2 • ■ ■ d?/fc 

(ft - ij! J[o,si''-i 



(2.26) 



Sketch of the proof. We do this only for t — oo. For k > 2 and y 
(i/i,...,2/fc) e K^ let 

■^M ) = H Xfc,s (%i , • ■ ■ , ) • 

l<Ji<-<Jfc<JV 



Elementary combinatorial considerations show that ^'"'(s;^) = 
Sf=2(~l)'°^fc"'(^) foi" ^ *^ ^A'- Multiplying this with the density Pat, 
integrating over Wn and using the permutation symmetry of Pjv — -^Pn and 
yields 

JV 

E[5<"'(s; A)] = V(-l)'= / Hxk -xi< s}^r(^) d'a;. (2.27) 



Now an obvious change of variables and the symmetry of i?^"' yields the asser- 
tion for t = oo. □ 



2.7 The orthogonal polynomial method 

In this section we briefly describe the most fruitful and most commonly used 
ansatz for the deeper investigation of the density P/v in (|2.3() : the method of or- 
thogonal polynomials. This technique has been first applied to random matrices 
by Mehta |Me60| but relies on much older research. For the general theory of 
orthogonal polynomials see |Sz75j . We follow jDe98l Sect. 5] and treat a general 
orthogonal polynomial ensemble of the form 

1 ^ 

PN{x) = ——ANixfT\e-Q^^^\ 2:=(xi,...,x^)GM^, (2.28) 

2—1 

where Q : M M is continuous and so large at infinity that all moments of the 
measure e"'^^^^ dx are finite. We normalized Pn to a probability density on R^. 

Let {iTj)j^fig with Trj{x) = x^ + bj-ix^~^ + • • • + bix + bo be the sequence of 
orthogonal polynomials for the measure e""^*^^-* dx, i.e., 

j iT.,{x)Tij{x)e^^^'''^ dx = c,Cj5^j, iJeNo. (2.29) 
Jm 
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(In the GUE-case Q{x) = x'^, these are the well-known Hermite polynomials, 
scaled such that the leading coefficients are one.) Elementary linear manipula- 
tions show that the Vandermonde determinant in (|1.3|l can be expressed in terms 
of the same determinant with the monomials replaced by the polynomials 
TTjix), i.e., 

AAr(a;) =det[(7rj_i(a;,)),j=i,...,Ar], xeR^. (2.30) 

We normalize the ttj now to obtain an orthonormal system i(l}j)jeth of i^(K) 
with respect to the Lebesgue measure: the functions 

^^{x) = le-«(^)/27rj(a;) (2.31) 

satisfy 

(x) (/)-,■ (x) da; = (5y, iJeNo. (2.32) 
An important role is played by the kernel Kn defined by 

KN{x,y) = ^A^)^M^ x,yeR. (2.33) 
i=o 

The fc-point correlation function in (|2.22l) admits the following fundamental 
determinantal representation. 

Lemma 2.8. Fix TV G N and x G M^, then, for any k e {I,.. ., N}, 

R'Pixi, ...,Xk)= Aet[{KM{x„Xj)),,,=i....,k\ ■ (2.34) 

In particular, 

R''i\xi) = Kn{xi,xi) and Pn{x) = dct[(iirAr(xi, a;j))jj=i,...,Ar] • 

(2.35) 

Sketch of the proof. Using the determinant multiplication theorem, is easily 
seen that the density Pn may be written in terms of the functions 4>j as 



2 ^2 



CnC 



Pn{x) = TTr= det[((?!)j_i(x,))ij=i,...,jv] 

^ ^'^^ (2.36) 

= det[(A'Ar(a;i, a;j))ij=i,...,7v] , 
Zn 

where Zn — ^i-Z^w HiliO^ Using the special structure of this kernel and 
some elegant but elementary integration method (see jUe98l Lemma 5.27]), one 
sees that the structure of the density is preserved under successive integration 
over the coordinates, i.e.. 



det[(ii'Ar(xi,a;j))ij=i,...,Ar] da;fe+ida;fc+2 • ■ - (^xn 



= [N - k)\dei[{KN{xi,Xj))i.j=i,...,k\, l<k<N. 

(2.37) 
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In particular, Zn = N\, and l|2.34(l holds. □ 

Remark 2.9 (Determinantal processes). Lemma [2.81 offers an important 
opportunity for far-reaching generalisations. One calls a point process (i.e., a 
locally finite collection of random points on the real line) a determinantal process 
if its correlation functions are given in the form H2.34|l . where K is, for some 
measure /i on R having all moments, the kernel of a nonnegative and locally trace 
class integral operator L^(R, /i) L^(IR,/i). Because of jDe98l Lemma 5.27], 
correlation functions that are built according to (|2.34(l form a consistent family 
of A''-particle distributions and therefore determine a point process on R. To a 
certain extent, random matrix calculations only depend on the determinantal 
structure of the correlation functions are may be used as a starting point for 
generalisations. O 

Now let A — (Ai, . . . , Aat) E Wn be a random variable with density Pn = 
NlPpf. We now express the probability that a given interval / contains a certain 
number of A^'s in terms of the operator ICn on L^{I) with kernel Kjqix, y). 

Lemma 2.10. For any N e N, any m e {0, . . . , iV}, and any interval I C M, 
P(#{* < N: A,; e /} = m) = ^^(^) Aet[{ld- ^K.n)\l-(i)] (2.38) 



7=1 



where Id denotes the identical operator in L^{I). 
Sketch of the proof. From Lemma [2 .61 and H2.34|l we have 
P(#{i < A^: A, e /} = m) 



= ^ jhU^. y , det[(i^^(x.,x,)),,=r,...,.] d'x. 

k—ra / I ^ 

On the other hand, for any 7 S M, by a classical formula for trace class operators, 
see |H.S758()I Vol. IV, Sect. 17] 

det[(Id-7/Cjv)|L2(^)] =^ / det[(i^Ar(a;„x,)),,,=i,...,fe]d*=x. (2.40) 

fc=o ' 

Now differentiate m times with respect to 7 and put 7 = 1 to arrive at 



□ 



2.8 Spacings in the bulk of the spectrum, and the sine kernel 

In this section, we explain the limiting spacing statistics in the bulk of the 
spectrum of a random unitary-invariant {N x N) matrix in the limit TV — > 00. We 
specialize to the matrix distribution in (|2.1() with F as in H2.2|l and Q{x) = x^^ 
for some j € N. This has the technical advantage of a perfect-scaling property 
of the eigenvalues: as was pointed out in Remark I2.5f iil. the correct rescaling 
of the eigenvalues is A-"' — X'^K In order to ease the notation, we shall 
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consider A'"' instead of A*"'. Note that the distribution of A<"' is the orthogonal 

polynomial ensemble in H2.28|l with Q{x) = Nx'^^ , and we shall stick to that 

choice of Q from now. 

Let i/j: K ^ [0, cxd) denote the density of the equilibrium measure (i.e., the 

unique minimizer) for the functional in (|2.19() with j2j = 1- According to the 

semicircle law, the rescaled eigenvalues A^"' lie asymptotically in the support 

of Tp, which is the rescaled bulk of the spectrum. In particular, the spacings 

between neighboring rescaled eigenvalues should be of order , and hence the 

J. 1 

spacings of the unsealed eigenvalues are of order N^^ 

We fix a threshold s > and a point u G supp('i/;)° in the bulk of the 
rescaled spectrum and want to describe the number of spacings < of the 
rescaled eigenvalues in a vicinity of u. Let {tN)Nef^ be a sequence in (0,oo) 
with ^Tv^OasiV^oo. The main object of our interest is the expected value 
of S'(^'(-^,m; a*"'), the number of spacings < in the sequence A*"' in a, t^- 
interval around u; see (|2.25|l . We expect that this number is comparable to t^^N, 
and we want to find the asymptotic dependence on s and u. 

We continue to follow |De98l Sect. 5] and stay in the framework of Section lTTl 
keeping all assumptions and all notation, and specializing to Q{x) = Nx'^K We 
indicate the A^-dependence of the weight function Q{x) = Nx"^^ by writing K''^^ 
for the kernel Km defined in H2.33|l and H2.31|l . Abbreviate 

nN{,u)=K'-^\u,u). (2.41) 

We write i?^"' for the 1-point correlation function with respect to the ensemble 
in H2.28|l with Q{x) = Nx"^^; hence R["\u)du is the density of -i- times the 
number of rescaled eigenvalues in du (see below (|2.22|l '). From (|2.35|l we have 
kn(u) = R'i'\u). Hence, the asymptotics of kn{u) can be guessed from the 
semi-circle law: we should have kn{u) = R'['''{u) « N'tlj{u). (In the GUE-case 
j — 1, we have \u\ < -\/2 and kn{u) « N^\/2 — v?.) We shall adapt the scaling 
of the expected number of spacings to the spot u where they are registered by 
using the scaling — W instead of 4?. This will turn out to make the value of 
the scaling limit independent of u. 

We use now Lemmas 12 . 71 and 12.81 and an elementary change of the integration 
variables to find the expectation of the number of rescaled eigenvalue spacings 
as follows. 



-E 



Kjv(u) 2tN 

det 

[0,s]'=-i 



V KAffuV Km(u)J 



t \"N \' TT'' ' I I dy2...dyk- 

KNyU) V Kn{u) Kn{U) / / i,j=l,...,ki j/i=0 

(2.42) 

Hence, we need the convergence of the rescaled kernel in the determinant on the 
right hand side. This is provided in the following theorem. The well-known sine 
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kernel is defined by 



§(x - y) 



sin(7r(a; — y)) 
7t{x - y) 



sin(7rx) sin'(7rj/) — sin'(7ra;) sm{'Ky) 



(2.43) 



7r(x - y) 



Proposition 2.11 (Bulk asymptotics for K^^^). Fix j e N and Q{x) = 
Nx"^'^ . Let K'j^^ be as in H2.33|l with the functions (pj defined in (|2.31|l such that 
H2.32|l holds. Denote hy^jj: M — > [0,oo) the equilibrium measure of the functional 
in (|2.19|) withj2j = 1- For u G supp(?A)°, abbreviate kj^{u) — K'j^' {u,u). Then, 
uniformly on compact subsets in u G supp(?/')° and x,y gM., 



For a rough outline of the proof using Riemann-Hilbert theory, see Sec- 
tion EUDl below. 

Remark 2.12. (i) The asymptotics in (|2.44|) in the GUE case j — 1, where 
the orthogonal polynomials are the Hermite polynomials, are called the 
Plancherel-Rotach asymptotics. 

(ii) Note that the limit in 12.4411 is independent of u, as long as u is in the inte- 
rior of the support of the equilibrium measure, i.e., as long as we consider 
the bulk of the spectrum. See Proposition 12 . 1 51 for the edge asymptotics. 

(iii) The asymptotics in Proposition l2. 1 ll are universal in the sense that they do 
not depend on the weight function Q{x), at least within the class Q{x) = 
x'^^ we consider here (after rescaling). The case of a polynomial Q{x) = 
2;2j _|_ 0(2;2j"-i-j jg asymptotically the same, but the proof is technically 
more involved. 

(iv) The proof of PropositionEniis in jDe98l Ch. 8], based on |KS99j . The first 
proof, even for more general functions Q, is in |PS97| . See also |D99| and 
|BI99| for related results. The main tool for deriving l|2.44|l (and many 
asymptotic assertions about orthogonal polynomials) are the Riemann- 
Hilbert theory and the Deift-Zhou steepest decent method. 

(v) Analogous results for weight functions of Laguerre type (recall (|2.t)|l 'l for 
(3 = 2 have been derived using adaptations of the methods mentioned in 
(iv). The best available result seems to be in |Va05j . where weight functions 
of the form fj,{dxi) = xfe~^'^^^^ dxi are considered with a > —1, and Q is 
an even polynomial with positive leading coefficient. The cases /3 = 1 and 
/3 = 4 are considered in ' DGKVOSl . 

(vi) The orthogonal and symplectic cases (i.e., /? = 1 and (3 = 4) for Hcrmite- 
type weight functions ^{dxi) = e""^^^'^ dxi with Q a polynomial have also 
been carried out recently |D(T05a| . 

(vii) Using the well-known Christoffel-Darboux formula 




(2.44) 




(2.45) 
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(where qj = iTj/cj] see (|2.31|l 'l. one can express the kernel Kn defined in 
(|2.33l) in terms of just two of the orthogonal polynomials. Note the formal 
analogy between the right hand sides of H2.45|l and H2.43() . O 

Now we formulate the main assertion about the limiting eigenvalue spacing 
for random unitary-invariant matrices. Denote by /Csin the integral operator 
whose kernel is the sine kernel in H2.43|l . 

Theorem 2.13 (Limiting eigenvalue spacing). Let Mjv he a random unitary- 
invariant matrix of the form (I2.1fl with F as in (12. 2() and Q{x) = x^^ for some 
j € N. Let A*"' = A^~^(A^"', . . . , A^') be the vector of scaled eigenvalues of 
M]y. Denote by ip'^ ^ [0,oo) the equilibrium measure of the functional in 
H2.19|l with "f2j = 1- Fix u G supp('i/')° and s > and let {tN)N&i be a sequence 
in (0, oo) with ijv — > 0. Recall the definition H2.25|l of the spacing number. Then 



lim — ^ -^E 



d^ 



N^^oc. NiPiu) 2tN L *" KNil^iuY 

where 

p{v) = ^dei[{ld-JCsin)\L-i[v,oo))]. v>Q, (2.47) 
is the density of the Gaudin distribution. 

Sketch of the proof. In l|2.42|l , replace the normalized r-integral by the integral 
over the delta- measure on u and use Proposition 12 . 1 ll to obtain 

left hand side of 

°° (—1)'^ f (2 48) 

= 1^ (fc- IV / _ det[(§(y, -yj)),j=i,...,fc]|^^^pd2/2---dyfc. 

On the other hand, note that 



j p{v) dv = 1 + — det[(Id - ICsin)\L^[s,oo))] 



de 



^ ' det[(Id - ICsin)\m[e.s])] 

6 = 



l + E^r /- .. det[(S(y.~2/,)).,,.i,....fc]d'=y 
fc=i 



=0 

= right hand side of 

(2.49) 

as an application of the product differentiation rule shows. □ 

Remark 2.14. (i) It is instructive to compare the asymptotic spacing distri- 
bution of the rescaled eigenvalues of a large random matrix (which have a 
mutual repellence) to the one of N independently on the interval [0, 1] ran- 
domly and uniformly distributed points (where no interaction appears). 
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The latter can be reaHzed as a standard conditional Poisson process, given 
that there are precisely N Poisson points in [0, 1]. The asymptotic spacing 
density for the latter is just v '—^ e"", and the former is t; i— > p{v) as in 
Theorem 12.131 Note that the asymptotics of p{v) for w | and the one 
for t; ^ oo are both smaller than the one of e^^. Indeed, it is known that 
p{v) « w'^ as w I if in H2.3|l the term An{x)^ is replaced by An{x)^ and, 
furthermore, p{v) w e~" as w ^ oo; see IDIZ97| and |De98[ Sect. 8.2]. 
(ii) Another variant of the assertion in H2.46|l is about the number of pairs of 
rescaled, not necessarily neighboring, eigenvalues whose difference is in a 
fixed interval (a, 6): 



lim — E 

N^oo N 
b , 



#{(*,j)e{l,...,7V}2:a<Ar'-Af' <5} 



1 



/sin(7rii) \ 

V TTU J 



du 



'l ifOe(a,6), ^'^■^^^ 
otherwise. 



The last term accounts for the pairs i — j. 

(iii) Proposition l2.11l and Theorem l2 . 1 31 are extended to a large class of Wigner 
matrices in more precisely to the class of random Hermitian ma- 
trices of the form W + aV, where is a Wigner matrix as in Section 

a > and ^ is a standard GUE-matrix. The entries of W are not assumed 
to have a symmetric distribution, but the expected value is supposed to 
be zero, the variance is fixed, and the (6 + £)-th moments for any e > 
are supposed to be uniformly bounded. This result shows universality of 
the limiting spacing distribution in a large class of Wigner matrices. The 
identification of the distribution of the eigenvalues of + aV uses the 
interpretation of the eigenvalue process of {W + aV)a>o as a process of 
non-colliding Brownian motions as in |Dy62b| , see Section EITl below. 

(iv) After appropriate asymptotic centering and normalization, the distribu- 
tion of the individual eigenvalues for GUE in the bulk of the spectrum is 
asymptotically Gaussian. Indeed, for = {a+o{l))N with a G (— \/2, V^) 
(i.e., a is in the interior of the support of the semicircle law /i* in H2.10|l '). 
the correct scaling of the z^v-th eigenvalue is 

A<"' - tV2N 



( log iV V 

v(i-2t^)jvy 

where t is determined by /i* ((— cxd, t]) = a. One main result of |Gus04| 
is that -^i^' is asymptotically standard normal as — *■ 00. Also joint 
distributions of several bulk eigenvalues in this scaling are considered in 
|Gus04j . In particular, it turns out that A*"' and A^"' are asymptotically 
independent if \i — j\ is of the order A^, but not if |« — j| = o{N). O 



2.9 The edge of the spectrum, and the Tracy- Widom distribution 

In this section we explain the limiting scaled distribution of the largest eigen- 
value, A^\ of an (A x A^) GUE-matrix, i.e., we speciaHze to j — 1. Let 
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A'"' = (A^"', . . . , A^') € Wn be the vector of the eigenvalues. According to 
Lemma 12.11 its distribution is the orthogonal polynomial ensemble in (|2.28|l 
with Q{x) = x^. Hence, the distribution of the vector of rescaled eigenvalues, 
^-i/2;^(iv)^ is that ensemble with Q{x) = Nx^. The event {A^' < t} is, for any 
i G M, identical to the event that no eigenvalue falls into the interval {t,(X)). 
Hence we may apply Lemma l2 . 1 01 for / = (t, oo) and m = 0. In order to obtain 
an interesting limit as — s- cx), we already know from the semicircle law that 
t should be chosen as t = ^/2N + 0{N") for some a < i. It will turn out that 
a = — g is the correct choice. 

As in the preceding section, we denote by K'^'' the kernel Kn defined in 
H2.33|l for the choice Q{x) = Nx'^, with the functions (pj defined in H2.31(l such 
that H2.32(l holds. Using Lemma 12.61 for m = and l|2.34(l . we see, after an 
elementary change of measure, that 

P(A<^' < V2N + siV2N^)-') 



N 



= E^/ det[f^i^i^rf^/2 + -^,^/2 + -^ 

(2.51) 

In order to obtain an interesting limit, one needs to show that the inte- 
grand on the right hand side of (|2.5HI converges. This is provided in the fol- 
lowing theorem. By Ai : R ^ R we denote the Airy function, the unique solu- 
tion to the differential equation f"{x) — xf{x) on R with asymptotics f{x) ~ 
{4:TT^/xy^'^e~i^^ as X — *■ oo. The corresponding kernel, the Airy kernel, is given 

by 

Km{x., y) = — — — — = / Ai(x+w)Ai y-t-u du, a;, y e R. 

x-y J a 

(2.52) 

Note the formal analogy to H2.43|l and H2.45|l . 

Proposition 2.15 (Edge asymptotics for K^). Uniformly in x,y Q R on 
compacts, 

lim —]-^K'"^ f V2 -t- -j^, V2 + -J— A = Km{x, y). (2.53) 

Remark 2.16. (i) Note that the kernel K'^^ scales with N~i at the edge 
of the spectrum, i.e., in ±\/2, while it scales with in the interior of the 
support of the equilibrium measure, (-V2, V^) (see Proposition Enj. 

(ii) The Airy kernel already appeared in BB91! in a related connection. Proofs 
of ProDOsition l2 . 1 5l were found independently by Tracy and Widom [TW93a] 
and Forrester |R)93 . 

(iii) For an extension of Proposition 12 . 1 5| to the weight function Q{x) = x"^^ 
for some j G N, see (De98. Sec. 7.6], e.g. The real and symplectic cases 
(i.e., P — \ and /3 = 4) have also been recently carried out |DG05b| . 



d x. 
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(iv) Analogous results for weight functions of Laguerre type (recall (|2.6|l and 
Remark Em v)) for /? = 1 and /3 = 4 are derived in |DGKV05| . Both 
boundaries, the 'hard' edge at zero and the 'soft ' one at the other end, 
are considered. O 

Next, we formulate the asymptotics for the edge of the spectrum, i.e., the 
largest (resp. smallest) eigenvalues. Let q: M ^ K. be the solution^ |HML80j of 
the Painlcve II differential equation 



q"{x) = xq{x)+2q{xy 



(2.54) 



with asymptotics q{x) ~ Ai(x) as x ^ oo. It is uniquely determined by requiring 
that q{x) > for any a; < 0, and it has asymptotics q{x) ~ y^\x\/2 as x — oo. 
Furthermore, ^'(a;) < for any x S M. 

Define a distribution function F2 : M ^ [0, 1] by 



-^2(5) = exp|- / {x - s)q'^{x)dx^, 



s e 



(2.55) 



This is the distribution of the by now famous GUE Tracy- Widom distribution; 
its importance is clear from the following. 

Theorem 2.17 (Limiting distribution of the largest eigenvalue, |TW94a| ) . 

Let Mjv be a random Hermitian matrix of size N from GUE, and let A^^' be 
the largest eigenvalue of Mn ■ Then, for any s G M, 



hm P((A^"> 



2N)V2N^^^ < s 



(2.56) 



Proof. Using H2.51|l and Proposition 12. 151 we obtain 



lim 



E 

k=0 



\(iV) 



fc! 



/2iV) V2N^/^ < s 



det 



iKAi{Xi,Xj))i^j^l^,,,^k 



d X 



= det^(Id- /CAi) 



(2.57) 



where /Cai is the operator on L^([s,oo)) with kernel Kai- The relation to the 
Painleve equation is derived in TW94a using a combination of techniques from 
operator theory and ordinary differential equations. □ 

Remark 2.18. (i) The great value of Theorem 12. 171 is the characterization 
of the limit on the left hand side of H2.57|l in terms of some basic ordinary 
differential equation, in this case the Painleve II equation. Analogous re- 
lations between the Gaudin distribution p in (|2.47|l and the Painleve V 
equation were derived in |JMMS80| . 

^The function u = —q is also a solution of i2.54i . which is sometimes called the Hastings- 
Mac Leod solution. 
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(ii) There are analogous statements for GOE and GSE jTW96j . The hniiting 
distributions are modifications of the GUE Tracy- Widom distribution. 
Indeed, for (3 = 1 and (3 = 4, respectively (recall (|2.5|l ). F2 is replaced 
by 

= exp{-i [q{x) + ix- s)q\x)] dx} = /FR^e"^ T '^^^ 
F4 is) - /fM^ i [e^ /r + e"^ ^ '^^j . 

(2.58) 

(iii) The expectation of a random variable with distribution function F2 is 
negative and has the value of approximately —1.7711. 

(iv) In jT W94aj . also the joint distribution of the first m top eigenvalues is 
treated; they admit an analogous limit theorem. The starting point for 
the proof is Lemma 12.61 and H2.34|l . 

(v) Theorem 12. 171 is generalized to a large class of Wigner matrices in jSo99| . 
It is assumed there that the entries have a symmetric distribution with all 
moments finite such that the asymptotics for high moments are bounded 
by those of the Gaussian. The proof is a variant of the method of moments 
(see the first proof of Theorem 12.3(1 . The main point is that the expected 
trace of high powers (appropriately coupled with the matrix size) of the 
random matrix is bounded by a certain asymptotics, which is essentially 
the same as for GUE. Since the expected trace of high moments depends 
on the matrix entries only via the moments, which are the same within the 
class considered, the result then follows from a comparison to the known 
asymptotics for GUE. 

(vi) If the index i^r is a bit away from the edge N, then the iAr-th largest 
eigenvalue scales to some Gaussian law. Indeed, ii = N ~ with 

— > 00, but kif/N 0, then the correct scaling is 



X, 



and one main result of |Gus04| is that X^^'' is asymptotically standard 
normal. Also joint distributions of several eigenvalues in this scaling are 
considered in |Gus04| . In particular, it turns out that A,-^' and A*-^' (pro- 
vided that N — ijsf and TV — are o{N)) are asymptotically correlated if 
\iN - JnI ^ o{N - iN). O 



2.10 Some elements of Riemann-Hilbert theory 

Apparently, the most powerful technical tool for deriving limiting assertions 
about orthogonal polynomial ensembles is the Riemann-Hilbert (RH) theory. 
This theory dates back to the 19th century and was originally introduced for the 
study of monodromy questions in ordinary differential equations, and has been 
applied to a host of models in analysis. Applications to orthogonal polynomials 
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were first developed in |FIK90| . and this method was first combined with a 
steepest-decent method in |DZ93| . Since then, a lot of deep results on random 
matrix theory and related models could be established using a combination of 
the two methods. The reformulation in terms of RH theory leaves the intuition 
of orthogonal polynomial ensembles behind, but creates a new framework, in 
which a new intuition arises and new technical tools become applicable which 
are suitable to deal with the difficulties stemming from the great number of zeros 
of the polynomials. For a recent general survey on Riemann-Hilbert theory, see 
| It03j : for a thorough exposition of the adaptation and application of this theory 
to orthogonal polynomials, see the lectures (De98| . and Ku03 , DOl and |BI99| . 

In this section, we give a rough indication of how to use Riemann-Hilbert 
theory for scaling limits for orthogonal polynomials, in particular we outline 
some elements of the proof of Proposition 12.111 Wc follow IDe98J . Let us start 
with the definition of a Riemann-Hilbert problem in a situation specialized to 
our purposes, omitting all technical issues. 

Let E be a finite union of the images of smooth, oriented curves in C, and 
suppose there is a smooth function v (called the jump matrix) on E with values 
in the set of complex regular (2 x 2)-matrices. We say a matrix-valued function 
y on C \ E solves the Riemann-Hilbert (RH) problem (E,t!) if 

(i) Y is analytic in C \ E, 

(m) Y+{x) = Y^{x)v{x), 
(lit) Y{z) = / + 

where / is the (2 x 2)-identity matrix, and Kf(x) and Y-{x) arc the limiting 
boundary values of K in x € E coming from the positive and negative side of 
E, respectively.^ 

The main connection with orthogonal polynomials is in Proposition 12 . 1 9l be- 
low. Assume that ^{dx) = w{x) dx is a positive measure on R having all mo- 
ments and a sufficiently regular density w, and let (7r„)„gNo be the sequence of 
orthogonal polynomials for the L^-inner product with weight w, such that the 
degree of 7r„ is n and the highest coefficient one. Hence, for some fc„ > 0, 

Trn{x)njn{x) fi{dx) ^ ^Sn,m, n,m gNo- (2.60) 
Recall the Cauchy transform, 

Cf{z)= [ I^pr. zeC\RJeH\M.). (2.61) 
Jt^x - z 27ri 

Here we think of R as of an oriented curve from — oo to oo, parametrized by the 
identity map. Note that C(/)+ - C(/)_ = / on R. 

"The definition of Y^(x) and Y-{x) and the sense in which (ii) is to be understood have 
to be explained rigorously, and (ii) is required outside the intersections of the curves only, 
but we neglect these issues here. The general notion involves (fc X fe)-matrices for some fc G N 
instead of (2 X 2)-matrices. 



a; e E, (2.59) 

as z — > oo. 
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Proposition 2.19 (RH problem for orthogonal polynomials, jFIKQOj . 

| FIK9l | V Fix 71 g N and consider the jump matrix v{x) —(^ J ) for a; € R. 
Then 

\^-27ri fc,';„i7r„_i(z) -27ri fc^_;^7r„_i(z)C(7r„_iu;)(z) y ' ^ 

(2.62) 

is i/ie unique solution of the RH problem"^ 

(i) 

(m) r|"'(x) 

(m) r<"'(z) 

The main advantage of the characterisation of the orthogonal polynomials 
in terms of a RH problem is that it provides a technical frame in which the 
difficulties stemming from the oscillations of the polynomials close to their zeros 
can be resolved. 

Now we specialize to w{x) ~ e-^'3(^) with Q{x) ~ x^^ for some j e N, recall 
Remark l2.5f ii) and Section IT^ We now write tt^"' instead of 7r„ for the orthog- 
onal polynomials. We shall (extremely briefly) indicate how the asymptotics of 
the 7V-th orthogonal polynomial tt]^' can be deduced from RH theory, building 
on Proposition 12. 191 

The first main step is a transformation of (|2.63(l which absorbs the exponen- 
tial term of the jump matrix into an inverse exponential term in the solution of 
the new RH problem.^ For doing this, we need to use some information about the 
variational formula in (|2.19|) with 72 j = 1. Recall the Euler-Lagrange equations 
in (|2.20|l for the equilibrium measure /x*(dx) = 'ipix) dx, and put 

g{z)= / log(z - a;)V'(a;) dx, z e C \ M. (2.64) 
Js. 

The intuitive idea behind the choice of g is the fact that, if a;^, . . . ,x*pf G M 
denote the zeros of ttJ^' and their empirical measure, then we can write 

<'(:e) =[](a;-<) = e'^i«'°«("-^^^"(''^^ (2.65) 

compare also to (|1.4|l . Since the asymptotic statistics of the zeros and of the 
ensemble particles are very close to each other, we should have tt^' « e^^, and 
e^9 will indeed turn out to be the main term in the expansion. 
Consider the transformed jump matrix 

/ Nlg^(x}~g+(x)] Nlg^ix)+g+{x)-Q(x)+l] \ 
^'^'(") - ( ,mg,ix,-gAx)] j, -eK. (2.66) 

^Note that I2.63i is not a standard RH problem, compare (iii). 

*This step is analogous to the exponential change of measure in large deviation theory, 
which absorbs the main exponential factor in the probability with respect to a transformed 
measure under which the deviation behavior becomes typical. However, because of the great 
amount of zeros of f"^', the exponential term is rather subtle here. 



is analytic m C \ M, 
rl"'(a;)w(a;). 



xeR, 



as z —> 00. 



(2.63) 
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Then the unique solution, m'^', of the RH problem (R, u'^') can easily be calcu- 
lated from y'"' in ProDOsition l2.19l its (1, l)-entry is Trj^'e"^^. This means that 
the leading (exponential) term has been isolated in the transformed RH prob- 
lem (R, w'"^'). It turns out that, outside the support of the equilibrium measure, 
v^^\x) is exponentially close to the identity matrix, and inside this support we 



have 



-27ri N fj.,{[x,oo)} 







1 

„27ri N fj.,{[x,oo)) 



1 



„2Nv-{x} 





-1 



1 



(2.67) 



where (p{z) — 27ri t(j{t) dt for z e C \ [—a, a], where we chose a > such that 
supp(/i*) = [—a, a]. We know from Remark I2.5f ii') that t i-^ ip{t){a'^ — t'^)~i 
is analytic in C, and hence ■i/;(t) dt depends on the integration curve from a 
to z: any two curves lead to a difference by an integer multiple of 27ri . Hence, 
z 1-^ e""^*^^^ is well-defined and analytic in C \ [~a, a] and therefore this is true 
for its boundary functions on (—a, a), and ip_. 

The next main step is a deformation of (R, u'^'), which isolates the second 
term in the expansion of tt]^', which is of fourth-root order and hence much 
more subtle. Indeed, the decomposition in the second line of (|2.t)7|) gives rise to 
a deformation into a RH problem (E, f *^'), where S is the union of the real line 
and two curves from —a to a in the upper and lower half plane, respectively, and 
v'^^ is some suitable jump matrix on S. It is relatively easy to prove that, in L^- 



sense, as TV ^ oo, we have v^' 



on S with TJoo = ( 



o) 



Hence, 



the unique solution, m*^\ of the problem (E,w*^') should converge towards the 
unique solution, moo, of the RH problem ([—a, a], Woo)- This is true, but relatively 
hard to prove, in particular on supp(/i*) and here in particular close to the 
boundaries ±a. It is easy to compute that 



Too 



1 

2i 



i(/3 4 



-/3 



i(/3 



where /3(z) = 



+ a/ 

(2.68) 

Computing m*^', re-substituting m<^' and TOoo, and considering the (1, l)-entry, 
we obtain therefore the asymptotics of tt]^' outside the critical points ±a: 



(^) 



(f^f)' + (Sf)'+°(l) 

if z G C \ supp(^>^), 
1 

(iV7r^*([z,a]) 4 



z-\-a 



cos 



cosf A^7r^*([z, a] 



0(1) 



TV J" \og\z-x\ fj.t{dx) 



if z e supp(/i*)°. 

(2.69) 

This explains how to derive the Plancherel-Rotach asymptotics for the orthog- 
onal polynomials for the weight function w{x) — e~^^ . Note that the error 
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terms in H2.69|l are locally uniform outside neighborhoods of ±a. Exploiting the 
Christoffcl-Darboux formula in (|2.45|) . one obtains the statement of Proposi- 
tion [TTH 

In order to obtain the asymptotics of Proposition 12. 151 i.e., the asymptotics 
of 7r^'(z) for z close to ±a, one uses an appropriate deformation into a suitable 
RH problems involving the Airy function, see |De98l Sect. 7.6], e.g. 

2.11 Random matrices and the Riemann zeta function 

Excitingly, it turned out in the early seventies that the spacings of the zeros 
of the Riemann zeta function show a close relation to those of the eigenvalues 
of certain random matrices. The famous Riemann zeta function is defined on 
{3?(s) > 1} as 

OO 

C(s) = ^n-^= n {1-p-T'- (2.70) 

n— 1 p prime 

Riemann showed in 1859 that ( can be meromorphically continued to the whole 
complex plane, and that the functional equation r(s/2)C(s)^/7r = 7r*r(i(l — 
s))C(l— s) holds. This continuation has simple zeros at the negative even integers 
and a simple pole at 1, which is the only singularity. Furthermore, there are 
infinitely many zeros in the so-called critical strip {0 < 3fi(s) < 1}, and none 
of them is real. These zeros are called the non-trivial zeros; they are located 
symmetrically around the real axis and around the line {5R(s) — ■t^}, the critical 
line. Denote them by p„ = /3„ -I- i 7„ with 7-i < < 71 < 72 < • • ■ ■ The famous 
Riemann Hypothesis conjectures that Pn = \ for every n, i.e., every non-trivial 
zero lies on the critical line {5i(s) = 5}. This is one of the most famous open 
problems in mathematics and has far reaching connections to other branches of 
mathematics. Many rigorous results in analytic number theory are conditional 
on the truth of the Riemann Hypothesis. There is extensive evidence for it being 
true, as many partial rigorous results and computer simulations have shown. See 
|Ed74] and ^86] for much more on the Riemann zeta function. 

It is known that the number tt^x) of prime numbers < x behaves asymptot- 
ically as 7r(a;) = Li(a;) -I- 0{x^ \ogx) as x — > 00, where Li(a;) is the principal 
value of /p du, which is asymptotic to and 8 = sup„gpj/3„. Hence, 

the Riemann Hypothesis is equivalent to a precise asymptotic statement about 
the prime number distribution. 

More interestingly from the viewpoint of orthogonal polynomial ensembles, 
the Riemann Hypothesis has also much to do with the vertical distribution 
of the Riemann zeros. Let N(T) be the number of zeros in the critical strip 
up to height T, counted according to multiplicity. It is known that N{T) = 
^ log jTj+Cllog'T") as T 00. In the pioneering work |Mo73| . vertical spacings 
of the Riemann zeros are considered. Denote by 

Rria, b) = ^#{ "i) e : 7„, 7™ <T,a< ^^^i^ J =^ ^} ' « < ^ 

(2.71) 

the number of pairs of rescaled critical Riemann zeros whose difference lies be- 
tween a and b. Then it was proved in |Mo73| . assuming the Riemann Hypothesis, 
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that 



lim i?T(a,5) 





1 - 


/sin(7ru) \ 2- 


/ a 




V TTU J . 



du 



1 ifOe(a,6), 
otherwise. 



(2.72) 



The last term accounts for the pairs m = n. Note the close similarity to (j2.50(l . 
Calculating millions of zeros, |Od87j confirms this asymptotics with an extraor- 
dinary accuracy. 

The Lindeldf Hypothesis is the conjecture that t) — 0{t^) as i — > cx3 for 

any e > 0. The (2/c)-th moment of the modulus of the Riemann zeta function, 



h{T) 



1 

T 



|C(i+it)|"' dt. 



(2.73) 



was originally studied in an attempt to prove the Lindelof Hypothesis, which 
is equivalent to Ik(T) = 0{t'^) as T cx3 for any e > and any k. The latter 
statement has been proved for k — I and k — 2. Based on random matrix 
calculations. 



conjectured that 
where G is the Barnes G-function, and 

P 



k e {5R(s) > -i}, 



a{k) 



me No 



r(m + fc)^^2 
m!r(fc) 



(2.74) 



(2.75) 



This so-called Keating- Snaith Conjecture was derived by an asymptotic calcula- 
tion for the Fourier transform of the logarithm of the characteristic polynomial 
of a random matrix from the Circular Unitary Ensemble introduced in Sec- 
tion 12.41 This conjecture is one of the rare (non-rigorous, however) progresses 
in recent decades in the understanding of the Riemann zeros. 



3. Random growth processes 



In this section we consider certain classes of random growth processes which 
turned out in the late 1990es to be closely connected to certain orthogonal poly- 
nomial ensembles. There is a number of physically motivated random growth 
processes which model growing surfaces under influences of randomly occur- 
ring events (like nucleation events) that locally increase a substrate, but have 
far-reaching correlations on a long-time run. In one space dimension, for these 
kinds of growth processes, limiting phenomena are conjectured that have morally 
some features of random matrices in common, like the fluctuation behavior of 
power-order 1/3 (instead of the order 1/2 in the central limit theorem and re- 
lated phenomena) and the universality of certain rescaled quantities. Recently 
some of these models could be analysed rigorously, after exciting discoveries of 
surprising relations to orthogonal polynomial ensembles had been made. 
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Random growth models may be defined in any dimension, and two and three 
dimensional models are of high interest. However, the high-dimensional cases 
seem mathematically intractable yet, such that we restrict to one-dimensional^ 
models in this text. General physics references on growing surfaces are the mono- 
graphs jBS95j and |Me98j : see also jKS92j . Much background is also provided 
in |P03j and |Fe04bj . Recent surveys on some growth models that have been 
solved in recent years by methods analogous to those used in random matrix 
theory are |Jo01c| and |Ba03j . 

After a short description of one basic model that cannot be handled rigor- 
ously yet in Section im we shall treat basically only two models: the corner- 
growth model introduced in Section 1301 and the PNG model introduced in Sec- 
tion 13.61 The main results on these two models are presented in Sections 13.31 
and Section 13.41 respectively in Sections 13.61 and 13.71 The famous and much- 
studied problem of the longest increasing subsequence of a random permutation 
is touched in Section ITsl since it is instrumental for the PNG model (and also 
important on its own). Furthermore, in Section 13.81 we mention the Plancherel 
measure as an technically important toy model that links combinatorics and 
orthogonal polynomials. 

3.1 The Eden- Richardson model 

A fundamental model for random growth is the so-called Eden- Richardson 
model, which is defined as follows. The model is a random process {A{t))t>o 
of subsets of 7? such that A{t) C A{s) for any t < s. At time t = 0, the set Aq is 
equal to {0}, the origin in Z^. We call a site (i, j) ^ 7?\ A{t) active at time t if 
some neighbor of {i,j) belongs to A{t). As soon as {i,j) is active, a random wait- 
ing time w{i,j) starts running, and after this time has elapsed, {i,j) is added 
to the set process as well. The waiting times w{i,j), {i,j) € Z^, are assumed 
to be independent and identically distributed (0, oo)-valued random variables. 
They can be discrete or continuous. In the case of N- valued waiting times, we 
consider the discrete-time process (A(t))tgNo instead of {A{t))t>o- If and only 
if the distribution of the waiting times is exponential, respectively geometric, 
the process {A{t))t>o, respectively {A{t))teNo^ enjoys the Markov property: in 
the discrete-time case, at each time unit any active site chooses independently 
with a fixed probability if it immediately belongs to the set process or not. In 
this special case, the model is called the Eden-Richardson model. The Markov 
property is not present for any other distribution. 

Actually, the Eden-Richardson model is equivalent to what probabilists call 
last-passage percolation, which we will explain more closely in R,emark l3.1l below. 

The natural question is about the asymptotic behavior of the set A{t) for large 
t. It is not so difficult to conjecture that there should be a law of large numbers 
be valid, i.e., there should be a deterministic set ^ C such that jA{t) ^ A as 
t —^ oo. A proof of this fact can be derived using the subadditive ergodic theorem 
jKe86| ■ which considers the Markovian case. However, an identification of the 
limiting set A and closer descriptions of A for general waiting time distributions 

^Taking into acount the time-evolution, they are sometimes also called (1 + l)-dimensional. 
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seem out of reach. In physics Hterature, it is conjectured that the fluctuations 

be of order t^^^. It is rather hard to analyze Eden's model mathematically 
rigorously. Reasons for that are that A{t) may and does have holes and that the 
growth proceeds in any direction. No technique has yet been found to attack the 
asymptotics of the fluctuations rigorously. This is why we do not spend time on 
the Eden model, but immediately turn to some simpler variant which has been 
successfully treated. 



3.2 The corner-growth model 

An important simpler variant of Eden's model is known as the corner growth 
model. This is a growth model on Nq instead of Z^, and growth is possible only 
in corners. At time zero, A{0) is the union of the x-axis Nq x {0} and the y- 
axis {0} X Nq. Points in \ A{t) are called active at time t if their left and 
their lower neighbors both belong to A{t). As soon as a point is active, 

its individual waiting time w{i,j) starts running, and after it elapses is 
added to the set. This defines a random process {A{t))t>o of growing subsets 
of Nq. Again, if the waiting times are N- valued, we consider {A{t))temo^ ^'^'^ 
the Markov property is present only for the two above mentioned waiting time 
distributions: the exponential, respectively the geometric, distributions. 

It is convenient to identify every point with the square — 5, * + ^) x — 
|, j + i) and to regard A{t) as a subset of [i, 00)^. The process {A{t))t>o consists 
of an infinite number of growing columns, of which almost all are of zero height 
and which are ordered non- increasingly in height. One can view these columns 
as a vector of runners who proceed like independent random walkers, making 
a unit step after an individual independent waiting time, subject to the rule 
that the (i + l)-st runner is stopped by the i-th runner as long as they are on 
the same level. Note that this is a suppression mechanism, not a conditioning 
mechanism. A realization of A{t) is as follows (the active sites are marked by 
'x'). 



Denote by G{M, N) the first time t at which the point (M, N) belongs to 
A{t). Obviously, G{M,N) depends on the variables w{i,j) with i < M and 
j < N only, and the recurrence relation 



G{M,N) =w{M,N) + max{G{M-l,N),G{M,N-l)}, M,N gN, (3.1) 
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is satisfied. From this relation, it is straightforward to derive the formula 

G{M,N)^ max V w{ij), M,NeN, (3.2) 

iren(M,A') ^-^ 

where n(M, N) is the set of nearest-neigbor paths with M + N—2 steps, starting 
from (1, 1), ending at (M, N) and having only steps (0, 1) or (1, 0). An example 
of an array of waiting times with M = 4 and = 6 is 
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We ordered the rows from the bottom to the top. Four maximal paths from 
the left lower corner to the right upper corner are depicted. 

Much of the interest in the corner-growth model stems from the fact that 
it has a couple of connections to other well-known models and admits several 
alternate descriptions: 

Remark 3.1 (Directed last-passage percolation). Switch to the dual lat- 
tice of (where the role of sites and bonds are interchanged) and interpret 
'w{i,j) as the travel time along the bond («, j). Then -^^^ w{i,j) is the travel 
time along the path tt € U{M, N) from (0, 0) to (M, TV). Hence, G{M, N) is the 
maximal travel time from (0, 0) to (M, N) along right/up paths, and this model 
is called directed last-passage percolation. The term 'directed' refers to the use 
of right/up paths. If all nearest- neighbor paths are admissable, then the model is 
called last-passage percolation; this is equivalent to the Eden-Richardson model 
of Section ITTl 

Switching the signs of w{i,j) and ignoring that —w{i,j) is negative, we see 
that —G{M,N) is the minimal travel time (now with passage 'times' —w{i,j)) 
from (0, 0) to (M, iV), which is the well-known model of first-passage percolation. 
An interpretation is as follows. If at the origin there is the source of a fluid, whose 
floating time along the bond {i,j) is —w{i,j), then the set A{t) = {{M, N) : — 
G{M, N) < t} is the set of bonds that are wet by time t. O 

Remark 3.2 (Totally asymmetric exclusion process). The boundary of 
the set A{t) C [^,00)^ is a curve that begins with infinitely many vertical line 
segments of unit length, proceeds with finitely many horizontal and vertical line 
segments of unit length, and ends with infinitely many horizontal line segments 
of unit length. If a square is added to A{t), then a vertical/horizontal pair of lines 
is changed into a horizontal/vertical pair. If we replace vertical lines by a '1' and 
horizontal lines by a '0' and determine the index that refers to the main diagonal 
of as 0, then we can think of the corner growth model as of a particle process 
(xfe(t))fegz G {0, 1}^ where Xk{t) = 1 means that one particle is present at site k 
at time t. In the case of geometric waiting time distribution, the dynamics of this 
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process is as follows. At each discrete time unit, every particle independently 
moves to the right neighboring site with a fixed probability, provided this site 
is vacant. Otherwise, it does not move. These are the dynamics of the so-called 
totally asymmetric exclusion process in discrete time. The event {G{M, N) = t} 
is the event that the particle that was initially at site 1 — has moved M 
steps by time t. There is an analogous representation in continuous time for the 
exponential waiting time distribution. O 

Remark 3.3 (Directed polymers in random environment). Let (S'„)„gNo 
be a simple random walk on Z, then the process {n, Sn)neNa interpreted as 
a directed polymer in Z^. Let j))igNo jsz be an i.i.d. field of real random 
variables. Every monomer (n, 5„) receives the weight l3v{n, Sn), where > is 
interpreted as the inverse of the temperature. This induces a probability measure 
on A'-step paths given by 



In the zero-temperature limit /3 oo, the measure Qn.p is concentrated on 
those paths {So, . . . , Sn) which minimize X^^^Lo ^("■' '^")- This is the analog of 
the corner-growth model with switched signs of the random variables; com- 
pare to (|3.2|) . It is believed that the directed polymer at positive, sufficiently 
small, temperature essentially exhibits the same large-A behavior as the zero- 
temperature limit, but this conjecture is largely unproven. An account on the 
recent research on directed polymers in random environment is in 

Remark 3.4 (Tandem queues). At time zero, there is an infinite number of 
customers in the first queue, and there is an infinite number of other queues, 
which are initially empty and have to be passed by every customer one after 
another. The first customer in any queue (if present) is served after a random 
waiting time, which has the distribution of the waiting times in the corner 
growth model, and he or she proceeds to the end of the next queue. Then, 
at every time t, the height of the i-th column of the set A{t) is equal to the 
number of customers which have passed or reached the i-th queue. A general 
and systematic discussion of the relation between tandem queues and orthogonal 
polynomial ensembles appears in |UC03| . O 

A systematic study of the random variable on the right side of H3.2|l and its 
asymptotics towards Brownian analogs is in EiQl]; see also EtWoH, Effl, 
[BJ02 and Do03 . In fact, for N fixed and under appropriate moment condi- 
tions, in the limit A/ — s- oo, this random variable (after proper centering and 
rescaling) converges in distribution towards 




L{N) 



max 

l>ti>--->t]v-i>0 . 



I^l(l) - Wl{ti) + W2{tl) - W2{t2) ± . . . 



(3.3) 



±WN{tN^l)-WNiO) , 
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where Wi , . . . , Wn are N independent standard Brownian motions on M start- 
ing at the origin. Using Donsker's invariance principle, this may be explained 
as follows. Assume that E[w(l,l)] = and E[?ii(l, 1)^] = 1. The first upstep 
of a path TT in (|3.2() may be expected in the (tAr_iM)-th step, the second in 
the (<Ar_2Af)-th step and so on, where we later optimize on 1 > ii > • • • > 
In-i > 0. The partial sums of w{i,ti-iM), . . . ,w{i,tiM — 1) approach the 
distribution of ^/M[Wi{ti-i) — Wi{ti)] for i = 1, . . . , iV. Hence, we have that 
A/-i/2g'(M,7V) => L{N) as M ^ cx), in the case of centered and normalized 
random variables w{i,j). 

A rather beautiful fact [Eani] . [C^TWOlj is that L{N) is in distribution equal 
to the largest eigenvalue of a GUE matrix, A^'. (For generalisations of this fact 
to Brownian motion in the fundamental chamber associated with a finite Coxeter 
group see BBO05 .) Recall from Thcorcm l2 . 1 71 that we may approximate A^' « 
\/2N+{y/2Ni)^^F2 for N large. Combining the limits for M oo and N oo, 
one is lead to the appealing conjecture (still assuming that E[?i;(l, 1)] = and 
EK1,1)2] = 1) 



G{M, N) 



M,7V^oo. (3.4) 



This assertion has indeed been proven independently in jBMOSj and |BS05| . 
under the additional assumption that M — o{N°') for a < j^. The main tool is 
a classical strong approximation of random walks by Brownian motion, which 
works so well that M can diverge together with N at some speed. However, 
the most interesting case is where M and TV are of the same order, and this 
case is open yet in general. For the two special cases of the geometric and the 
exponential distribution, (|3.4|l has been proven for AI w const, x N. Our next 
two sections are devoted to a description of this result. 

3.3 Johansson's identification of the distribution 

In his beautiful work (JoOOa) . Kurt Johansson deeply investigated the corner- 
growth model for both particular waiting-time distributions, the geometric and 
the exponential distribution. He identified the distribution of G{M, N) in terms 
of the distribution of the largest particle of the Laguerre ensemble (see (|2.6f) ) in 
the exponential case, and of the Meixner ensemble (its discrete analog) in the 
geometric case. 

Proposition 3.5 (Distribution of G(M,iV), [loOOa ). Let G{M,N) be de- 
fined as in (13.21) . and let the w{i,j) be i.i.d. geometrically distributed with pa- 
rameter q £ (0,1), i.e., w{i,j) — k eN with probability {l — q)q''. Then, for any 
M,N <eN with M >N, and for any t £ N, 

FiGiM,N)<t)^-^ Y: A^(x)^n )H (3-5) 

Zm,n ^ , I'-V ^» J ^ 

Remark 3.6. (i) The right hand side of (|3.5|) is the probability that the 
largest particle in the Meixner ensemble on with parameters q and 
M - N is smaller than t + N. 
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(ii) There is an extension of ProDOsition l3 . 51 to the case where the parameter of 
the geometric distribution of w{i, j) is of the form aibj for certain numbers 
ai,bj e (0, 1), see |Jo01cl Sect. 2]. 

(iii) An analogous formula holds for the case of exponentially distributed wait- 
ing times, and the corresponding ensemble is the Laguerre ensemble (Gam- 
ma-distribution in place of the negative Binomial distribution), see H2.6(l . 
This formula is derived in |JoOOa] using an elementary limiting procedure 
which produces the exponential distribution from the geometric one. It is 
remarkable that no direct proof is known yet. Distributions other than the 
exponential or geometric one have not yet been successfully treated. O 

Sketch of the proof of Proposition f3.5l The proof in |JoOOal relies on various 
combinatoric tools, which have been useful in various parts of mathematics for 
decades. A general reference is |Sa91| . 

A generalized permutation is an array of two rows with integer entries such 
that the row of the pairs is non-decreasingly ordered in lexicographical sense. 
An example is 



111112222344 
113331113323 



(3.6) 



the entries of the first and second line are taken from {1,2,3,4} and {1,2,3}, 
respectively. A longest increasing subsequence of the second row has the length 
8; it consists of all the '1' and the last three '3'. Also the first two ones and all 
the threes form a longest increasing subsequence. 

Lemma 3.7 (Matrices and generalized permutations). For any M, N,k G 

N, the following procedure defines a one-to-one map between the set of {M x A^)- 
matrices {W{i, j))i<M,j<N with positive integer entries and total sum 
^i<M i<N ^{hj) equal to k, and the set of generalized permutations of length 
k whose first row has entries in {I, ... , AI} and whose second row has entries in 
{1, . . . , N}: Repeat every pair {i,j) G {1, . . . , M} x {1, . . . , N} precisely W{i,j) 
times, and list all pairs in lexicographical order. By this procedure, the quan- 
tity max^gn(M,A') j)e7r '■^ capped onto the length of the longest non- 
decreasing subsequence of the second row. 



As an example for M = 4, A^ = 3, the matrix 

(3.7) 



W 



/ 1 1 \ 
1 
3 1 
V 2 3 / 

is mapped onto the generalized permutation a in H3.6|l . (In order to appeal to the 
orientation of the corner growth model, we ordered the rows of W from the bot- 
tom to the top, contrary to the order one is used to from linear algebra.) The two 
paths linking the coordinates (1, 1), (2, 1), (2, 3), (4, 3) and (1, 1), (1, 3), (4, 3), re- 
spectively, are maximal paths in H3.2|l : they correspond to the longest increasing 
subsequences mentioned below H3.6|l. 
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Remark 3.8. (i) For the application of Lemma 13.71 for W{i,j) — 'w{i,j) 
geometrically distributed random variables, it is of crucial importance that 
this distribution induces a uniform distribution on the set of {M x N)- 
matrices with fixed sum of the entries, 
(ii) Obviously, Lemma l3 . 71 works a priori only for integer-valued matrices. O 

The next step is a famous bijection between generalized permutations and 
Young tableaux. A semi-standard Young tableau^'^ is a finite array of rows, 
nonincreasing in lengths, having integer entries which are nondecreasing along 
the rows and strictly increasing along the columns. The shape of the tableau, 
A = (Ai)i, is the vector of lengths of the rows. In particular, Ai is the length 
of the longest row of the tableau, and Ai is the total number of entries. An 
example of a semi-standard Young tableau with shape A = (10,8,8,3,1) and 
entries in {1, . . . , 6} is as follows. 
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Lemma 3.9 (Robinson-Schensted-Knuth (RSK) correspondence, 
[KTOj ) . For any M, N,k G N, there is a bijection between the set of gener- 
alized permutations of length k whose first row has entries in {1, . . . , M} and 
whose second row has entries in {1, . . . , N}, and the set of pairs of semi-standard 
Young tableaux of the same shape with total number of entries equal to k, such 
that the entries of the first Young tableau are taken from {!,..., M} and the 
ones of the second from {!,..., N} . This bijection maps the length of the longest 
non- decreasing subsequence of the second row of the permutation onto the length 
of the first row of the tableau, Ai. 

The algorithm was introduced in |Sc61| for permutations (it is a variant of 
the well-known patience sorting algorithm) and was extended to generalized 
permutations in |K70| . 

Sofar, the distribution of G{AI, N) has been reformulated in terms of the 
length of the first row of pairs of semi-standard Young tableaux. The next and 
final tool is a combinatorial formula for the number of Young tableaux. 

Lemma 3.10 (Number of semi-standard Young tableaux). The number 
of semi-standard Young tableaux of shape A and elements in {1, . . . , N} is equal 
to 

-i-r Xj - \,j + j - I 

j — i 

l<i<j<N •' 



For the notions of (standard) Young tableaux and Young diagrams, see Section l3.8l below. 
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The reader easily recognizes that the combinatorial formula in Lemma |3. 101 
is the kernel of the formula in 13. 5|) . Putting together the tools listed sofar, one 
easily arrives at H3.5fl . □ 

Remark 3.11. An alternate characterization and derivation of the distribution 
of G{M, N) is given in |Jo02al Sect. 2.4] in terms of the Krawtchouk ensemble, 

KrM,n,,ix) = Am(x)2 n f r 9"" . ^ G {0, . ■ • , n}^'nWM. 

(3.8) 

There a family of random non-colliding one-dimensional nearest-neighbor proces- 
ses is analyzed, which is a discrete analog of the multilayer PNG-droplet model 
in Section 1X71 below. The joint distribution of this cascade of processes is iden- 
tified in terms of the the Krawtchouk ensemble, and the marginal distribution 
of the rightmost process is identified in terms of G{M, N). This implies that 

P(G(M,7V)<0- J2 KrM.t+Ar+A/-i,,(x), (3.9) 

x<£{0,...,t+M-l}'^' 

i.e., G{M, N) is characterized in terms of the largest particle of the Krawtchouk 
ensemble. O 



3.4 Asymptotics for the Markovian corner-growth model 

Having arrived at the description in H3.5|l , the machinery of statistical mechanics 
and orthogonal polynomials can be applied. The outcome is the following. 

Theorem 3.12 (Asymptotics for the corner-growth model, fJoOOa|). 

Consider the model of Provosition fX3l Then, for any 7 > 1, 

{^) hm lE[G(L7iVj,7V)] = ^i±^^ _ 1 ^ /(^,g), (3.IO) 

where F2 is the distribution function of the GUE Tracy-Widom distribution 
introduced in H2.55|l . and <j{^,q) is some explicit function. 

Remark 3.13. (i) In Theorem 13.121 the weak law of large numbers 
limf^oo = ^ is contained with 

A = {{x, y) e [0, oof:y + 2^9^ +x<\-q}. 



A qualitative picture of A is as follows. 
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(ii) The analogous result for exponentially distributed waiting times is also 
contained in 'JoOOa'. 

(iii) Theorem imT ii) is equivalent to (|^ with M [77VJ (recall that the 
w{i,j) are centered and normalized there, but not in Theorem 13. 121) . 

(iv) Theorem 13.121 is the first and yet only rigorous result of behavior of the 
type 'fluctuations « mean 3 ' for random growth models of the Eden type. 

(v) In IJoOOaj also some large-deviation estimates are proven, i.e., the proba- 
bilities of the events {G([7iVj , iV) < iV(/(7,g)-e)}andof{G([77Vj,iV) > 
N{f{'j,q) + e)} are asymptotically estimated. The former decays on the 
scale e'^'^^-', while the latter decays on the scale e'-'*-^-'. 

(vi) The proof of Theorem 13.121 can also be built on the Krawtchouk repre- 
sentation in l|3.9|l by using asymptotics for the Krawtchouk kernel; see 
|.)o()2a| . O 

Sketch of the proof of Theorem 13.121 The structure of this proof is analo- 
gous to the proof of Theorem 12. 171 The right hand side of H3.5|l may be written 
in terms of the Meixner kernel 



M-N+l, (3.12) 



where mj{x) — kjx^ + 0{x' 



— rvjo,- -r \^\^^ ^) are the orthonormal polynomials with respect 
to the discrete weight w'"l'{x) = (^^^^^)'?^, a; e N. (Both the polynomials vrij 
and the kernel K'^^ also depend on L and q.) Indeed, computations similar to 
those of Section [^7| imply that 

right hand side of H3.5() 



N 

E 



Ml! 

fc! 



fc=0 he{t+N,t+N+l,...}'^ 

The Meixner kernel satisfies the scaling limit 

hm aN"^K'^l{{f + l)N + aN"^x,{J + l)N + aN"^y) ^ KM{x,y), 



det 



(3.13) 



(3.14) 



where Xai is the Airy kernel in (|2.53|) . and / = f{j,q) and a = cr{'-f,q) are as 
in the theorem. Now the remainder of the proof is analogous to the proof of 
Theorem Em □ 
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3.5 Longest increasing subsequences of random permutations 

Another problem that has been recognized to be strongly related to random 
growth processes is the problem of the length of the longest increasing subse- 
quence of a random permutation. Let &n denote the set of permutations of 
1, . . . , iV, and let ct be a random variable that is uniformly distributed on Sat, 
i.e., a random permutation. The length of the longest increasing subsequence 
of cr is the maximal k such that there are indices 1 < ii < 12 < ■ ■ ■ < ik ^ N 
satisfying a{ii) < a{i2) < ■ ■ ■ < (T{ik)- We denote this length by £n. In the 
early 1960's, Ulam raised the question about the large- iV behavior of £n. Based 
on computer simulations, he conjectured that c = limAr^oo -^~^^^IE(£jv) exists 
in (0,00). The verification of this statement and the identification of c have 
become known as 'Ulam's problem'. A long list of researchers contributed to 
this problem, including Hammcrsley, Logan and Shepp, Vershik and Kerov, and 
Seppalainen. By the end of the 1990's, it was known that the above limit exists 
with c = 2, and computer simulations suggested that^"'^ 



1- ""Hu. - (3-15) 

A survey on the history of Ulam's problem may be found in |UROO| and |AD99| . 

There is a 'Poissonized' version of Ulam's problem, which is strongly related 
and provides a technical tool for the solution of Ulam's problem. Consider a 
homogeneous Poisson process on (0, 00)'^ with parameter one, and let L{X) be 
the maximal number of points of this process which can be joined together by 
a polygon hue that starts at (0,0), ends at (VA, \/A) and is always going in 
an up/right direction. Then it is easy to see^^ that the distribution of L{X) is 
equal to the distribution of £n*, where N* is a Poisson random variable with 
parameter A. Via Tauberian theorems, asymptotics of the distribution of L{X) 
as A ^ cxD stand in a one-to-one correspondence to the large- A'^ asymptotics of 

There are exact formulas for the distributions both of £n and L{X), which 
have been proved by many authors using various methods (see |BD.T99| ). Indeed, 
for any n G N, we have 



V J Jl—IT,TV\ l<k<J 

exp {2\/A^cos6'J JI 



(2Ny. J, ^^i,A^ JJ 11 ' ' (27r)"n! 

V J J\—iT,TV\ l<k<]<n ^ ' 

ie.i2 d"6i 



e - e 



j=l l<fc<j<n 



(27r)"n! 
(3.16) 

In "BDJ99' , sophisticated and deep methods are applied to the right hand side of 
(EUni, which have previously been established in jDTgS) . |DZ95| and ,DVZ97] : 

"Recall Remark l2?T8T iii1. 

^^The main reason is the characteristic property of the Poisson process that, given that 
there are precisely Poisson points in the square, these points are conditionally independent 
and uniformly distributed. 
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the steepest-decent method for the computation of asymptotics of solutions to 
certain Riemann-Hilbert problems. As a result, a limit law for ^^r is proved, 
which shows again the universality of the Tracy- Widom distribution for GUE 
in (IT55I1: 



Theorem 3.14 (Limit law for £n, [BDJ99 ). Let £n be the length of the 
longest increasing subsequence of a random permutation, which is uniformly 
distributed over &n. Then, as N ^ 00, the scaled random variable 



converges in distribution towards the Tracy-Widom distribution for GUE. More- 
over, all moments of xn ctlso converge towards the moments of this distribution. 
Both assertions are true also for {L(\) — 2\/A)A~6 as A — > 00. 

Sketch of the proof. We sketch some elements of the proof, partially also 
following |P08[ Sect. 3.1]. We consider the Poissonized version and consider 
L(A2) instead of L(A). 

The starting point is an explicit expression for the probability of {i(A^) < 
N} for any TV G N and any A > in terms of the Toeplitz determinant^'^ 
Dn,x = detTAr(e^'^™^(')). More precisely, one has 

P(L(A2) < TV) = e'^' detrjv(e2^™^(-)) = e'^'Djv.A, (3.18) 

a remarkable formula which has first been derived in (Ge90| . based on the RSK- 
correspondence of Lemma 13.91 On [0, 27r] we introduce the inner product 

(P,9>A-/ p{e'%{e^')e''^°^'^. (3.19) 

Jo 

Consider the sequence of orthogonal polynomials (7r^')jvgNo with respect to 
{■,-)\ which is obtained via the Gram-Schmidt algorithm from the monomials 
z", n e No. We normalize tt]^' such that 7r<^'(z) = + 0{z^~^) and define 
y^''' = ||7r<^'||2^ such that we have 

{^n\^n')^ = ^n,n'V;^"\ N,N' eNo. (3.20) 

Classical results on orthogonal polynomials (see |Sz75| for some background) 
imply the identities 

JV-l 



i^jv.A = detT^(e2^ =-(•') =det(((z^^')A)M=o.....JV-l) = [] ^fc' 

k=0 

N-1 k 

=(K,<^')T[n(i-«'(o))')- 



(3.21) 



k=o 1=1 



i3-\Ye recall that the (A'' X N) Toeplitz matrix Tpf{f) = [iif^_i)f^ i—o,....N-i with respect 
to the weight function / on [0, 2tt] is defined by the Fourier coeficients /i^. = J^^ e' f{9) 
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For our special choice of the weight function, g , one obtains a nonhnear 
recursion relation for the sequence (7r^'(0))ArGNo5 which are called the discrete 
Painleve 11 equations. Indeed, the numbers i?^' = (— l)^"'"^7r^'(0) satisfy 



Kn+1- ^R-N + ^N-i 1 _ ' Jyem. [6.22) 



Putting N = [2A + AasJ, niultipling 1)3. 22|) with A and letting A oo, we see 
that the function 

q(s) = - lim A^i?'^' 1 , s G K, (3.23) 

A^oo L2A+A3SJ 

should satisfy the (continuous) Painleve II equation in H2.54|l . The initial value 
i?g^' = —1, i.e., g'(— 2A^) ^ A^, implies that we are dealing with that solution 
of (|2.54(l that is positive in (— cxd,0). Hence, q is identical to the solution q of 
<|2.54|l with q{x) ^ Ai(x) as a; — > oo; recall the text below (|2.54|) . 

Note that implies that Dn+i,xDn-i,x/D% ;^ = 1 - (i?^')^. Using this 

in (|3.18|l we obtain, for A — > oo, 



(^)'logP((L(A2)-2A)A-* < s)^\i(\ogDN+i,x~2logDN,x+\ogDN^i 
= Ailog(l-(i?;^' , )') «-(A^i?<^' ,Y^~q{sf^{logF,ns). 

(3.24) 

Hence, we have finished the identification of the limiting distribution of L(A^). 

The technically hardest works of the proof are the proofs of the convergence 
in (|3.23|l and of the convergence of the moments, which require an adaptation 
of the Deift-Zhou steepest descent method for an associated Riemann-Hilbert 
problem. □ 



3.6 The poly nuclear growth model 

Consider the boundary of a one-dimensional substrate, which is formed by the 
graph of a piecewise constant function with unit steps. At each time t > 0, the 
separation line between the substrate and its complement is given as the graph 
of the function h{-,t): M ^ R. Occasionally, there occur random nuclear events 
in states a;* at times t* , and the process of the pairs {x*,t*) forms a Poisson 
point process in the space-time half plane M x [0, oo) with intensity equal to two. 
Such an event creates an island of height one with zero width, i.e., h has a jump 
of size one in x* at time t* . Every island grows laterally (deterministically) in 
both directions with velocity one, but keeps its height, i.e., for small e > the 
curve h{-, t* + e) has the height h{x* , t*) in the £-neighborhood of x* and stays 
on the same level as before t* outside this neighborhood: 
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h{x,t* +e) 



h{x*,t*) 



h{x*,t*) - 1 



X 



X 



■* -e 



X* +e 



The bullet marks the nucleation event, and the two arrows indicate the lateral 
growth of velocity one in the two directions. We call the graph of h(-, t* + e) in 
the £-neighborhood of x* a growing island. If two growing islands at the same 
level collide, then they merge together and form a common growing island. The 
nucleation events occur only on top of a growing island, and they occur with 
constant density equal to two. 

This is a (rather simple) model for poly nuclear growth (PNG) in 1 + 1 dimen- 
sion. Among various initial conditions that one could impose, we shall consider 
only two: the flat case, where h{x, 0) = for any a; S M, and the droplet case, 
where h{x,0) = — oo for a; 7^ and /i(0,0) = 0. The droplet case may also be 
defined with the initial condition h{-,0) = by requiring that nucleation events 
at time t may happen only in [— i,t]. 

Let us first consider the droplet case. A beautiful observation |PSnO| is the fact 
that the PNG model stands in a one-to-one relation to the Poissonized problem 
of the longest increasing subsequence in a rectangle. Using this correspondence, 
one arrives at the following limit assertion. 

Theorem 3.15 (Limit Ism for the PNG droplet, [PSOOj). Let h{x,t) be 
the height of the PNG droplet at time t over the site x, and let c [—1.1]. Then 



where F2 is the GUE Tracy-Widom distribution function, see (|2.55() . 

Idea of proof. We consider the space-time half plane M x [0, 00). For any space- 
time point (a;, t), we call the quarter plane with lower corner at (x, t) and having 
the two lines through {x,t) with slope 1 and —1 as boundaries the (a;, t)-quarter 
plane. Recall that nucleation events occur in the (0, 0)-quarterplane only, which 
is the region {(a:, t) : \x\ < t}. 

First note that every nucleation event at some space-time point (a;*, t*) influ- 
ences the height of the curve h only within the (a;*, t*)-quarter plane. Second, 
note that any nucleation event (y*,s*) within the (a;* , )-quarter plane con- 
tributes an additional lifting by level one (to the lift created by the nucleation 




(3.25) 
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event {x*,t*)) for any space-time point in the intersection of the two quarter 
planes of the nucleation events, since the growing island created by {y* , s*) will 
be on top of the growing island created by {x*,t*). However, if {y*,s*) occurs 
outside the (x* , t* )-quarter plane, their influences are merged to a lift just by 
one step since their growing islands are merged to one growing island. 

Now fix a space-time point (x, t) in the (0, 0)-quarter plane. In the space-time 
plane, consider the rectangle R having two opposite corners at the origin and at 
the point {x,t) and having sides of slopes 1 and —1 only. Condition on a fixed 
number N of nucleation events {xl,tl), . . . , (x^, t*j^) in the rectangle R. 

time 




space 



Rotate the rectangle by 45 degrees. The preceding observations imply that 
only those nucleation events contribute to the height h{x, t) which can be joined 
together by a polygon line consisting of straight up/right lines, leading from the 
corner of the rectangle R at the origin to the corner at {x,t). The maximal 
number of nucleation events along such path is equal to the height h{x,t). 
Hence, the length of the longest increasing subsequence in a unit square with 
Poisson intensity A = Vi^ — a? has the same distribution as the height h{x,t). 
Using Theorem 13. 141 one concludes the assertion. □ 

In particular, the fluctuation exponent 1/3 is rigorously proved for this growth 
model. Such a result has not yet been achieved for any other growth model of 
this type. However, this fluctuation behavior is conjectured for a large class of 
(1 -|- l)-dimcnsional growth processes, provided the spatial correlations are not 
too weak. 

The flat initial condition, h{-,0) — 0, interestingly leads to the GOE Tracy- 
Widom distribution instead of the GUE one: 

Theorem 3.16 (Limit law for the flat PNG model, [PSOOJ). Let h{x,t) 
be the height of the flat PNG model at time t over the site x. Then, 

lini pfM^^4^ < 2-2/3 ^Fi(s), seM, (3.26) 

t— too V ^3 / 

where Fi is the GOE Tracy- Widom distribution function, see (|2.58|) . 
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The above explanation for the droplet case has to be adapted to the flat case 
by replacing the rectangle with corners at the origin and (x, t) by the triangle 
with base on the axis t = 0, corner at (x, t) and side slopes 1 and —1. See |Fe04a| 
for more detailed results on the flat PNG model. 

For other initial conditions (among which some lead to the GSE Tracy- Widom 
distribution, F4), see jP03 ! Sect. 3]. We recall that a discrete-space version of 
the PNG model is analyzed in JoOj^ Sect. 2.4]; see also Remark l3.11l A recent 
survey on the PNG droplet and its relation to random matrices and further 
random processes, like directed polymers, the longest increasing subsequence 
problem and Young tableaux, appears in |FP05j . 

3.7 The multi-layer PNG droplet and the Airy process 

The PNG droplet has been analysed also as a process. Interestingly, the limiting 
distribution of the height process in the correct scaling bears a close relationship 
to Dyson's Brownian motions (see Theorem 14. If) , which is best seen when ad- 
ditional layers of substrate separation lines are introduced. The so-called multi- 
layer PNG droplet (sometimes also called the discrete PNG model) is defined 
as follows. We write ho instead of h and add an infinite sequence of separation 
lines he{x, t) with i e — N with initial condition he{x, 0) = £. Nucleation events 
only occur to the zeroth line ho, and they occur at time t in the interval [— 
only (i.e., we consider the droplet case). Every merging event in the £-th line 
(i.e., every event of an amalgamation of two neighboring growing islands at the 
same height) creates a nucleation event in the {£ — l)-st line at the same site. 
Apart from this rule, every island on any level grows deterministically with unit 
speed into the two lateral directions as before. 

Hence, randomness is induced only at the zeroth line, and all the other lines 
are deterministic functions of Hq. Observe that the strict ordering hg{x,t) > 
hg-i{x,t) for any x,t,£ is preserved. Hence, the lines form a family of non- 
colliding step functions with unit steps. For any £ g —No and at any time t > 0, 
the £-th line hi{-,t) is constant equal to £ far away from the origin. Only a 
finite (random) number of them have received any influence coming from the 
nucleation events, and only within a finite (random) space-time window. 

An interesting observation JS02a, is that, in the long-time limit, the multi- 
layer PNG droplet process approaches the large- iV limit of Dyson's Brownian 
motions (see Section ITTI below l in the appropriate scaling. More precisely, let 
A'"'(t) = (Ai"'(i), . . . , A^>(i)) e Wn be Dyson's Brownian motion at time t as 
in Theorem l4.1l Then the Airy process may be introduced as the scaled limiting 
distribution of the largest particle, more precisely, 

{V2N^X^^\yN- ^) - Vm))^^^ =^ my))yeu- (3-27) 

Computer simulations show that the space-time region in which the lines are not constant 
asymptotically forms a droplet that approaches a circle. This region stays strictly inside the 
circle, which is due to the negativity of the expectation of the GUE Tracy- Widom distribution, 
recall Remark l2.18f iiiV 

^^To be more precise, in contrast to Section l4.ll here Dyson's Brownian motions arc not 
based on Brownian motions, but on Ornstein-Ulenbeck processes, which are Brownian motions 
with drift to the origin and hence stationary. 
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Convergence has been established in the sense of finite-dimensional distributions 
in |PS02aj and in process sense in |Jo03| . For any y > 0, the random variable 
2ti(y) has the GUE Tracy- Widom distribution F2 in 12.55|l . and the family of 
these random variables forms an interesting stochastic process. The Airy process 
(2ti(j/))j^gR is a stationary, continuous non-Markovian stochastic process which 
may be defined via its finite dimensional distributions, using a determinant 
formula involving the Airy kernel KAi in (|2.52|) PSO^, see also P03, Sect. 5]. 

In |PS02aj it turns out that, in the appropriate scaling, the joint distribution 
of all the lines hi of the multilayer PNG droplet approaches the Airy process. 
We state the consequence of this statement for the first line as follows. 

Theorem 3.17 (Process convergence of the PNG droplet, [PS02al). 

Let h{x,t) he the height of the PNG droplet at time t over the site x. Then, in 
the sense of finite- dimensional distributions, 

h{th,ty2t ^ ^^^^^ ^ ^ 

where {^i{y))y^^ is the Airy process. 

Some progress on the process version of the flat PNG model has been made 
in |Fe04aj . Discrete versions of the PNG model have been analysed in |ISfl4a| . 
pSOlb'. 

Another interesting process that converges (after proper rescaling) in distri- 
bution towards the Airy process is the north polar region of the Aztec diamond 

3.8 The Plancherel measure 

The Plancherel measure is a distribution on the set of Young tableaux which 
exhibits an asymptotic behavior that is remarkably similar to that of the spec- 
trum of Gaussian matrix ensembles. Most interestingly, this measure may be 
studied for any value of the parameter /3, which is restricted to the values 1, 2 
and 4 in the matrix cases. 

A Young diagram, or equivalently a partition A = (Ai, A2, . . . ) of {1, . . . , N} is 
an array of TV boxes, such that Ai of them are in the first row, A2 of them in the 
second and so on. Here A is an integer-valued partition such that Ai > A2 > . . . , 
and — ^ ■ We think of the rows as being arranged on top of each other. 

A standard Young tableau is a Young diagram together with a filling of the 
boxes with the numbers 1, . . . , iV such that the numbers are strictly increasing 
along the rows and along the columns. The vector A is called the shape of the 
tableau. For every A, we denote by dx the number of Young tableaux of shape 
A. For every /3 > 0, we define the Plancherel measure as the distribution on the 
set Y]si of partitions of {1, ... , iV}, which is given by 

P1^'(A)= p . XeY^. (3.29) 

^® Compare to the definition of a semistandard Young tableau prior to Lemma |3.9l where 
more numbers may appear, and their order is just nondecreasing. 
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We can conceive A^." , the length of the fc-th row, as an No-valued random 
variable under the probability measure Pl^' on Yjy. 

The case f3 = 2 has been studied a lot. Basically, it was shown that the 
limiting statistics of the sequence X["\ X^^\ . . . , in an appropriate scaling, is the 
same as the one for the eigenvalues of an {N x TV) GUE-matrix. We mention just 
a few important results. As a by-product of their study of the longest increasing 
subsequence of a random permutation, in I BDJ99| the limit theorem 

lim Pl<^' ( ^ < .s ) = F2(s), s e R, (3.30) 

is shown, where F2 is the Tracy- Widom GUE distribution function. The con- 
jecture of jBD,T99j that for every fc S N the scaled limiting distribution of AJ. 
is identical to the one of the fc-th largest eigenvalue of a GUE-matrix was in- 
dependently proved in |BD.TOO| for fc = 2, and for general k in |,To01bj and 
jBUO00| . The convergence of the moments of the scaled row lengths was also 
proved in |BDJ99| . |BDJOO| and |Jo01bj . respectively. The bulk-scaling limit 
was also proved in |BUUOO| . The case /3 = 1 (which is analogous to the GOE 
case instead the GUE case) has been studied in IBE01| . 

4. Non-colliding random processes 

In this section wc systematically discuss conditional multi-dimensional random 
processes given that the components never collide with each other. These pro- 
cesses are sometimes called vicious walkers, non- colliding processes or noninter- 
secting paths in the literature. The earliest hint at a close connection between 
non-colliding random processes and orthogonal polynomial ensembles was found 
in |Dy62b| , where a natural process version of the Gaussian Unitary Ensemble 
was considered. It turned out there that the mutual repellence in (12. 3|) receives 
a natural interpretation in terms of Brownian motions conditioned on never col- 
liding with each other. This theme apparently was not taken up in the literature 
up to the beginning of the nineties, when people working in stochastic analysis 
turned to this subject. Since the discovery of close connections also with ran- 
dom growth models at the end of the nineties, non-colliding processes became 
an active research area. 

4.1 Dyson's Brownian motions 

A glance at the Hermite ensemble in (12. 3|) shows that there is a mutually re- 
pelling force between the eigenvalues: the density vanishes if any two of the N 
arguments approach each other. It does not seem easy to derive an intuitive 
reason for this repellence from random matrix considerations, but if the ma- 
trix M is embedded in a natural process of random Hermitian matrices, then 
the process of eigenvalues admits a nice identification that makes the repellence 
natural. 

Theorem 4.1 (Dyson's Brownian motions, |Dy62b| ). For any i € 
{!,..., TV} resp. i < j, let (M,,,(t))t>o and iM^y{t))t>o and (M^;j(<))t>o be 
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independent real standard Brownian motions, starting at zero, such that the 
Hermitian random matrix M[t) = {Mij{t))ij^i^,,,^]\f with Mi_j{t) — M^^'{t) + 
i M^^j{t) has the distribution of GUE at time t = 1. Then the process {X{t))t>o 
of eigenvalue vectors \{t) of M{t) is a conditional Brownian motion on M.^ , 
starting at zero, given that the particles never collide with each other, i.e., con- 
ditional on the event {\i{t) < \2{t) < ■ ■ ■ < \N{t) for all i > 0}. 

This theorem has to be explained in some detail. 

Remark 4.2. (i) It is remarkable that, in particular, the process of eigen- 
value vectors is Markov. This is not true for, say, the process of the largest 
eigenvalues, {\N{t))t>o- 

(ii) The original proof in |Dy62b| makes nowadays an old-fashioned impres- 
sion. See |Br91| for a modern stochastic analysis treatment of an analogous 
matrix-valued process for Wishart-matrices in the real-valued setting. In 
this setting, the process of eigenvalues also turns out to be Markov, but 
does not admit a conditional interpretation. The latter is also true in the 
analogous GOE setting. 

(iii) The event of never coUiding, {\i{t) < X2{t) < ■ ■ ■ < Ajv(i) for all t > 0}, 
has zero probability for N independent Brownian motions. Hence, the 
definition of the conditioned process needs some care. First observe that 
the non-colliding event is the event {X{t) S Wn for all t > 0}, where 
Wn = {x G : a:i < X2 < • • • < xn} is the Weyl chamber. Probabilists 
like to write this event as {T — oo}, where T — inf{t > 0: X{t) e W^} 
is the exit time from Wn, the first time of a collision of any two of the 
particles. One way to construct the conditional process is to condition 
on the event {T > t} and prove that there is a limiting process as t 
oo. Another one is to consider the Doob-/i transform of the vector of 
N independent standard Brownian motions with some suitable function 
h : Wn (0, cxd) that vanishes on the boundary of Wn and is harmonic for 
the generator of the A'^-dimensional Brownian motion in Wn ■ Remarkably, 
it turns out that h = A^r, the Vandermondc determinant, satisfies all these 
properties, and that the /i-transform with this function h is identical with 
the outcome of the first construction. See Section IT!^ below for a general 
treatment of this issue. 

(iv) The Markov process (A(t))f>o has the invariant measure x t-^ An{x)'^ dx, 
which cannot be normalized. 

(v) Also in the real and the symplectic version, the eigenvalue process, (A(t))t>o, 
turns out to be a diffusion. An elementary application of Ito's formula 
shows that (A(t))f>o satisfies the stochastic differential equation (see jBr91| 
for related formulas) 

^ 1 

dA, =dS, +/3V- -dt, z = l,...,7V, (4.1) 

j=i Xi - Xj 

where Bi, . . . , Bn are independent Brownian motions, and /3 G {1, 2, 4} is 
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the parameter as in H2.5|l . The generator of the process (A(t))f>o is 

N N N 



(4.2) 



i=l i=i 



The generators in the GOE and the GSE setting have a factor different 
from 2 before the drift term. Apparently this circumstance makes it im- 
possible to conceive the processes as Doob transforms of N independent 
processes. O 



4.2 Harmonicity of the Vandermonde determinant 

Now we consider more general multi-dimensional random processes and their 
conditional version given that no collision of the particles occurs. As was pointed 
out in Remark l4.2r iii). the construction needs some care, since the conditioning 
is on a set of probability zero. It turns out that the rigorous definition may be 
given for many processes in terms of a Doob /i-transform with = Aat , the Van- 
dermonde determinant in . Even more striking, the marginal distribution of 
the transformed process turns out, for a couple of explicit examples, to be given 
by well-known orthogonal polynomial ensembles, like the Hermite ensemble in 
1)2. 3(1 for the case of conditional Brownian motions. 



4-2.1. The continuous case. 



Let us turn first to the time-continuous case with continuous paths, more pre- 
cisely, to diffusions. We fix G N and an interval / and let X — {X{t))t>o be 
a stochastic process on . Assume that Xi, . . . , X^ are A^ independent and 
identically distributed diffusions Xi = {Xi{t))t>a on /. Under the measure P^, 
they start at Xi{0) — Xi £ I, where x = (xi, . . . , xjv). By pt{x, y) we denote the 
transition density function of any of the diffusions Xi, i.e., 

N 

F,{X{t)edy) = l[[pt{x,,y,)dy,], x,yel^. (4.3) 

i=l 

Recall the Weyl chamber and its exit time, 

Wn ^ {x eR'^ : xi < ■■■ < xn} and T ^ M{t > 0: X{t) ^ Wn}. 

(4.4) 

In words: T is the first time of a collision of any two of the A^ components of the 
process. Recall the Vandermonde determinant Ajv(x) = ni<i<j<jv(^j ^ ^«)- 
order to be able to construct a Doob-/i transform of the process with h = Ajy on 
Wn, the basic requirements are: (1) An is positive on Wn, (2) An is harmonic 
with respect to the generator G of the process X, i.e., GAn — 0, and (3) 
Ajv(X(t)) is integrable for any t > 0. 

Clearly, the first prerequisite is satisfied. Furthermore, it turns out that An 
is harmonic for a quite large class of processes: 
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Lemma 4.3 (Harmonicity of A^r, continuous case jKOOlp . We have 
GAn — (i.e., An is harmonic with respect to G) if there are a, 6, c € M such 
that 

N N 

G = ^[{ax^+b)df+cd^'\ or = ^ [(x?+ax,+6)af +(|(iV-2)a;,+c)a, 

i=l i=l 

(4.5) 

The proof consists of an elementary calculation. Lemma 14.31 in particular 
covers the cases of Brownian motion, squared Bessel processes (squared norms 
of Brownian motions) and generalized Ornstein-Uhlenbeck processes driven by 
Brownian motion. For general diffusions, existence and identification of positive 
harmonic functions for the restriction of the generator to the Weyl chamber are 
open. 

As a consequence of Lemma 14.31 we can introduce the Doob /i-transform of 
X with h — An. This is a diffusion on Wn H , which we also denote X. Its 
transition probability function is given by 

hiXit) e dy) = P,(T > t;X{t) e d2/)^44' x,yeWNnl^,t> 0. 

An(x) 

(4.6) 

The transformed process is often called the conditional process X, given that 
there is no collision of the components. In order to justify this name, one must 
show that 

lim P^{X{s) edy\T>t)= P^{X{s) G dy), for any x,y e Wn, s > 0. 

t — i-OO 

(4.7) 

This may be proven in many examples with the help of the Markov property at 
time s and an asymptotic formula for Fz{T > t) as t ^ oo, see Remark l4.10f ii'). 
In Section 14.31 we provide two tools. In Section 14.41 we list a couple of exam- 
ples of Ajv-transformed diffusions, whose marginal distribution is an orthogonal 
polynomial ensemble. 



4-2.2. The discrete case. 



There is also a discrete version of Lemma lOl Recall that a vector w on a discrete 
set / is called a positive regular function for a matrix Q with index set I x I if 
all the components of v are positive and Qv = v holds. 

Lemma 4.4 (Regularity of An, discrete case [KOR02'). Let (X(n))„gN 
be a random walk on such that the step distribution is exchangeable and the 
N-th moment of the steps is finite. 

(i) Then An is harmonic for the walk, i.e., 'E.x[An{X{\))] = An{x) for any 
X G M.^ , and the process Ajv(X(ri))„gNQ is a martingale with respect to 
the natural filtration of {X{n))neN- 
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(a) If {X{n))n takes values in only and no step from Wn to W j\j has posi- 
tive probability, then the restriction of An to WjvnZ"'^ is a positive regular 
function for the restriction Pwn — {pi^iy))x.y<£WNni,'^ of the transition 
matrix P = {p{x,y))^^y(zz,N , i.e., 

pix,y)ANiy) = AN{x), for any x e HWn. (4.8) 

The condition in Lemma I4.4r ii') is a severe restriction. It in particular ap- 
plies to nearest-neighbor walks on with independent components, and to 
the multinomial walk, where at each discrete time unit one randomly chosen 
component makes a unit step, see Section ^31 Further examples comprise birth 
and death processes and the Yule process |Dor)5[ Ch. 6] . 

Under the assumptions of Lemma 14.41 one can again define the /i-transform 
of the Markov chain X by using the transition matrix P = (p(a;, j/))^, ygv^wnz" 
with 

p{x,y)=p{x,y)^^, x,yeWNnZ^. 

Remark 4.5. Arbitrary random walks with i.i.d. components are considered in 
IEK05-I-I . Under the sole assumption of finiteness of sufficiently high moments 
of the steps, it turns out there that the function 

Vix)^AN{x)~E,[ANiX{T))l xeWN. 

where r = inf{n E N: X{t) ^ Wn} is the exit time from Wn, is a positive reg- 
ular function for the restriction of the walk to Wn . (Note that V coincides with 
Ajv in the special cases of Lemma |4.4f iiV) Since the steps are now arbitrarily 
large, the term 'non-colliding' should be replaced by 'ordered'. Furthermore, an 
ordered version of the walk is constructed in terms of a Doob /i-transform with 
h = V, and some asymptotic statements are derived, in particular an invariance 
principle towards Dyson's Brownian motions. O 

4.3 Some tools 

We present two technical tools that prove useful in the determination of prob- 
abilities of non-collision events. 

4-. 3.1. The Karlin-Mc Gregor formula 

An important tool for calculating non-colliding probabilities is the Karlin- 
McGregor formula, which expresses the marginal distribution of the non-colliding 
process in terms of a certain determinant. 

Lemma 4.6 (Karlin-McGregor formula, [KM59 ). Let {X{t))t>o be a dif- 
fusion on that satisfies the strong Markov property. Then, for any x,y G Wn 
and any t > 0, 
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where pt{x,y) is the transition probability function of the diffusion, see 1)4.3(1 . 

Proof. By ©at we denote the set of permutations of l,...,iV, and sign((T) 
denotes the signum of a permutation a. We write j/o- — (?/cr(i)j ■ • • j ya(N))- We 
have 



since ah the summands Vx{T > t,X{t) E dya-)/dy are equal to zero, with the 
exception of the one for the identical permutation. 

At time T, the i-th and the j-th coordinate of the process coincide for some 
i < j, which we may choose minimal. Reflect the path {X{s))s^[T.t] in the 
plane, i.e., map this path onto the path {X\{s))s^[T.t]j where A G (3jv is the 
transposition that interchanges i and j. This map is measure-preserving, and 
the endpoint of the outcome is at yo-oA if ^{t) = y^^. Summing on all i < j (i.e., 
on all transpositions A), substituting <j o \ and noting that its signum is the 
negative signum of cr, we see that the right hand side of (|4.10|l is equal to its 
negative value, i.e., it is equal to zero. The proof is finished. □ 

Remark 4.7. (i) The main properties of the process that make this proof 
possible are the strong Markov property and the continuity of the paths. 
No assumption on spatial dependence of the transition probability function 
is needed. 

(ii) For discrete-time processes on Z there is an analogous variant of Lcmma^^l 
but a kind of continuity assumption has to be imposed: The steps must 
be —1, or 1 only, i.e., it must be a nearest-neigbor walk. This ensures 
that the path steps on the boundary of Wn when leaving Wn , and hence 
the reflection procedure can be applied. O 

4-3.2. The Schur polynomials 

Another useful tool when dealing with certain determinants is the Schur poly- 
nomial, 



F,{T>t,X{t)€dy) 
dy 



det[{pt{x„yj)) 




sign(cr) 




(4.10) 



Schur2(a;) 




N 



(4.11) 



AAr(a;) 



It turns out that Schur^ is a multipolynomial in xi, . . . , xat, and it is homo- 
geneous of degree zi + ■ ■ ■ + zn — ~ coefficients are nonnegative 
integers and may be defined in a combinatorial way. It has the properties 
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Schur2(l,. . . , 1) = An{z)/An{x*) (where we recall that x* = (0, 1, 2, . . . , iV - 
1)), Schura;.(a;) = 1 for any x E K^, and Schur2(0, . . . , 0) = for any z G 
Wn\{x*}. 

A combination of the Karlin-McGregor formula and the Schur polynomials 
identifies the asymptotics of the non-collision probability and the limiting joint 
distribution of N standard Brownian motions before the first collision: 

Lemma 4.8. Let {X(t))t>o be a standard Brownian motion, starting at x E 
Wn ■ Then, as t oo, the limiting distribution oft^ix{t) given that T > t 
has the density y ■|-(^(?/)Ajv(j/) on Wn, where ip is the standard Gaus- 
sian density, and Z the normalization constant. Furthermore, ¥x(T > t) = 
AN{x)t-^^^-^\C + o{l)) ast^oo for some C > 0. 

Note that the limiting distribution is of the form 1)1. l|l with replaced by 
Ajv, i.e., with (3—1. 
Sketch of proof. Lemma [4.61 yields 

F,{T>t,t--2Xit)edy) 



det 



{2ny~e 



f ^-(^.-»Vt)V(2t) 



dy 

= (27r)-fe-M^/(2*)e-"^"^/2det[( 

e-\\v\\l/2 



i,j = l,...,N 



^ll5/(2*)Ajv(z)Schur,(^), 
(27r) 2 

(4.12) 

where we put Zi = e^'/^. Now we consider the limit as t — s- oo. The second 
term is (1 + o(l)), and the continuity of Schurj, implies that the last term 
converges to Ajqiy) / An^x*). Using the approximation e^'/^ — 1 ^ Xi/\/i, we 
see that An{z) ^ t^~''^~^'^ An{x). Hence, the right hand side of H4.12|l is equal 
to ip{y)AN{y)AN{x)t-^^^-'^\l/AN{x*) +o(l)). Integrating on y e Wn, we 
obtain the last statement of the lemma. Dividing the left hand side of (|4.12|l by 
fx{T > t) and using the above asymptotics, we obtain the first one. □ 

4.4 Marginal distributions and ensembles 

We apply now the technical tools of Section 14.31 to identify the marginal dis- 
tribution of some particular Ajv-transformed processes as certain orthogonal 
polynomial ensembles. 

4-.4-1- The continuous case. 

Lemma 4.9 (Marginal distribution for A^r-transformed diffusions, 

|KO01p . Assume that I is an interval and X is a diffusion on such that 
the Vandermonde determinant A^r is harmonic for its generator and Ajv(X(t)) 
is integrable for any t > Q. Assume that there is a Taylor expansion 

Pl^^f,{x)f2{xyra„,{t), t>0,yel, 
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for X in a neighborhood of zero, where am(t) > and ft{x) > satisfy 
limj^oo am+i{t) / Omit) = and ft{0) = 1 = linit^oo ft{x). Then, for any t > 
and some suitable Ct > 0, 

lim P,(X(t)edy)=CtAAr(y)2Po(X(<)€dy), y^W^- (4.13) 

Furthermore, for any x G Wm, 

P,(T >t)^ Ajv(x)Eo [AAr(X(i))]l{x(t)6VV„}] , t ^ oo. (4.14) 

Remark 4.10. (i) Relation H4.13|l is remarkable since it provides a host of 
examples of orthogonal polynomial ensembles that appear as the marginal 
distribution of /i-transformed diffusions with h = A^r (recall that Po(X(t) G 
Ay) is a product measure). Explicit examples are the Hermite ensemble 
for Brownian motion and the Laguerre ensemble for squared Bessel pro- 
cesses, where Po(X(i) G Ay) is the Gamma distribution. Most of the other 
examples covered by Lemma |4. 91 do not seem to be explicit. 

(ii) Relation H4.7|l may be deduced from H4.14|l , if the right hand side is asymp- 
totically equivalent when t is replaced hy t — s for some s > 0. This has 
not been worked out yet in general, but can be easily seen in a couple 
of special cases. It would justify the notion 'non-colliding diffusion' for 
/i-transformed diffusions with /i = Ajv- 

(iii) A natural question is what examples (besides the Hermite ensemble, i.e., 
Brownian motions; see Section r4.1|l lead to processes that can be repre- 
sented as eigenvalue processes for suitable matrix-valued diffusions. We 
mention here the Laguerre process, the non-colliding version of squared 
Bessel processes, which is in distribution equal to the eigenvalue process 
of a natural processes of complex Wishart matrices (' |KO01| : see Re- 
mark |^fv)). We recall that the real-matrix case, which does not seem 
to admit an /i-transform interpretation, is worked out in jBr91| . 

(iv) Further important examples with physical relevance are derived in |KT04| : 
in fact, process versions of all ten classes of Gaussian random matrices 
mentioned at the beginning of Section|21are analysed, and their eigenvalue 
processes are characterised in terms of non-colliding diffusions. 

(v) In |KNT04j , independent Brownian motions are conditioned on non-collision 
up to a fixed time, S. The result is a time-inhomogeneous diffusion whose 
transition probabilities depend on S. This conditioned process converges 
towards Dyson's Brownian motions as S ^ oo. In jKTOSj . the distribu- 
tion of the conditional process is identified in terms of a certain eigen- 
value diffusion of a matrix- valued diffusion. Indeed, let (Mi(i))t>o be a 
Hermitian matrix-valued diffusion whose sub-diagonal and diagonal en- 
tries are ^N(N + 1) independent standard real Brownian motions, and 
let {M2{t))t>o be an antisymmetric matrix- valued diffusion whose sub- 
diagonal entries are ^N{N — 1) real independent Brownian bridges (i.e., 
Brownian motions conditioned on being back to the origin at time S). 
Then the eigenvalue process for the matrix Mi (t) + i M2 (t) is a realisation 
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of the above conditioned Brownian motion process, given that no coUision 
happens by time S. The matrix diffusion {Mi{t) + i Af2(i))t6[o,s] is a one- 
parameter interpolation between GUE and GOE (hence it is sometimes 
called a two-matrix model). Indeed, recall the well-known independent de- 
composition of a Brownian motion {B{t))t>o into the Brownian bridge 
{B{t) — -^B{S))tfz[o^s] and the Hnear function (^-B(5'))tg[o,S] and decom- 
pose Milt) in that way. Collecting the bridge parts of Mi(t) -I- i M2{t) 
in one process and the remaining variables in the other, we obtain the 
interpolation. 

(vi) Infinite systems of non-colliding random processes are considered in [BaOnj 
and in jKNT04j . The nearest-neighbor discrete-time case is the subject of 
|Bann| where the limiting distribution at time of the left-most walker 
is derived, conditional on a certain coupling of the total number of left- 
steps among all the walkers with N; the outcome is a certain elementary 
transformation of the Tracy- Widom distribution for GUE. In [KNTO^ . a 
system of N Brownian motions, conditional on non-collision until a fixed 
time S, is analysed in the limit N oo and S — > oo, coupled with each 
other in various ways. O 



4-4-^- The discrete case. 

We present three examples of conditioned random walks on Z^: the binomial 
random walk (leading to the Krawtchouk ensemble), the Poisson random walk 
(leading to the Charlier ensemble) and its de-Poissonized version, the multino- 
mial walk. 

For i — 1, . . . , N , lei Xi — (Xj(n))„gN(, be the binomial walk, i.e., at each 
discrete time unit the walker makes a step of size one with probability p £ (0, 1) 
or stands still otherwise. The walks Xi, . . . , are assumed independent. Under 
the A'^-dimensional process X = {Xi,.. .,Xn) starts at Xo = a; G N^. The 
A AT-transformed process on n Wn has the transition probabilities 

P,(X(n) = y) = P,(X(n) = y,T> n)^44' a;, y e n W^jv, n G N. 

(4.15) 

This marginal distribution, when the process is started at the particular site 
a;* = (0, 1, 2, . . . , A'' — 1), is identified in terms of the Krawtchouk ensemble in 
HH.8|I as follows. 

Lemma 4.11 (AAr-transformed binomial walk, |KOR02] '). Let x* = 

(0, 1, 2, . . . , TV - 1). Then, for any n G N, and y G Z^ n Wn, 

¥.j,>{X{n) = y) = KrAT (y). (4.16) 

Such an identification is known only for the particular starting point x* . The 
proof is based on the Karlin-McGregor formula and some elementary calcula- 
tions for certain determinants. 
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The Poisson random walk, Xi — {Xi(t))t>Q, on Nq makes steps of size one 
after independent exponential random times. If Xi, . . . ,Xn are independent, 
the process X = {Xi, . . . , Xn) on No makes steps after independent exponen- 
tial times of parameter N, and the steps are uniformly distributed on the set 
of the unit vectors. The embedded discrete-time walk is the so-called multi- 
nomial walk; at times 1,2,3,..., a randomly picked component makes a unit 
step. Lemma l4.4f ii) applies also here, and we may consider the Ajv-transformed 
version, both in continuous time and in discrete time. The marginal distribution 
of the discrete-time process is given in (|4.15ll . and the same formula holds true 
for the continuous-time version with n e N replaced by t > 0. 

Analogously to the binomial walk, the marginal distributions of both con- 
ditioned walks, when the process is started at cc* = (0, 1, 2, . . . , — 1), may 
be identified in terms of well-known ensembles, which we introduce first. The 
Charlier ensemble with parameter a > and iV G N is given as 

ChA..„(x) = - An{x)^T\^, xeN^nWN. (4.17) 

Za,N Xi\ 

The de-Poissonized Charlier ensemble is defined as 

dPChjv,„(a;) - ^Ajv(x)2Muw,„(x), a;eN^nVKjv,neNo,, (4.18) 

where 

I otherwise. 

Then the free multinomial random walk has the marginals Pj.(X(n) = y) = 
Mujv,„(2/- x). 

Lemma 4.12 (Conditioned Poisson and multinomial walks, |.KOR02] ^ . 

Let X* = (0,1, 2,..., TV - 1). 

(i) Let X = (X(i))t>Q be the Poisson walk, then the marginal distribution of 
the conditional process satisfies, for any i > and x g n , 

P,.(X(t) = a;) = Chjv,t(a;). (4.20) 

(i) Let X — (X(n))„gN„ be the multinomial walk, then the marginal distribu- 
tion of the conditional process satisfies, for any n G No and x € N^ n W^v, 

P,. {X{n) = x)= dVG\iN,,+N[N-i),2{x). (4.21) 

The proofs of Lemma 14.121 are based on the Karlin-McGregor formula and 
explicit calculations for certain determinants. 
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